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IMPACT:  Imaging  and  Molecular  Markers  for  Patients  with  Lung  Cancer:  Approaches  with 
Molecular  Targets,  Complementary,  Innovative  and  Therapeutic  Modalities 


INTRODUCTION 

Lung  cancer  is  the  most  prevalent  cancer  worldwide  and  the  leading  cause  of  cancer-related 
mortality  in  both  men  and  women  in  the  United  States.  Conventional  multimodality  therapies 
(surgery,  radiation  and  chemotherapy)  have  reached  a  therapeutic  ceiling  in  improving  the  five- 
year  overall  survival  rate  of  non-small  cell  lung  cancer  (NSCLC)  patients,  clinically  in  large  part 
due  to  chemo-  and  radiation-resistant  locoregional  and  metastatic  spread  but  ultimately  due  to 
poor  understanding  of  the  disease  and  its  resistance  to  the  therapy. 

Lung  cancer  is  a  heterogeneous  disease,  resulting  from  accumulated  genetic  abnormalities  over 
years,  which  thus  requires  a  coordinated  attack  in  a  truly  integrated  fashion  on  multiple  altered 
signal  pathways.  Emerging  targeted  therapy  aims  to  target  key  molecular  abnormalities  in 
cancer  and  has  succeeded  in  some  tumor  types  such  as  chronic  myeloid  leukemia  (CML) 
(Druker  et  al.,  2004;  Druker  and  Sawyers  et  al.,  2001;  Druker  and  Talpaz  et  al.,  2001), 
gastrointestinal  stromal  tumor  (Demetri  et  al.,  2002),  colon  cancer  (Hurwitz  et  al.,  2003),  and 
breast  cancer  (Howell  et  al.,  2005).  Thus,  the  incorporation  of  targeted  therapy  into  conventional 
treatments  appears  to  be  a  new  promising  approach  to  treatment  of  lung  cancer. 


The  program  project  IMPACT  has  proposed  to  integrate  targeted  therapy  in  the  lung  cancer 
research  program  when  initial  clinical  results  showed  disappointing  response  rates  and  survival 
benefit  of  epidermal  growth  factor  receptor  (EGFR)  inhibitor  Gefitinib  (Iressa)  for  non-selected 
lung  cancer  patients  (Herbst  et  al.,  2002,  2003,2004;  Herbst,  2004;  Kris  et  al.,  2003;  Giaccone 
et  al.,  2004).  It  aims  to  validate  molecular  mechanisms  of  targeted  agents  alone  and  in 
combination  with  chemo  and/or  radiation  therapies  in  preclinical  and  clinical  settings.  It  also 
aims  to  develop  effective  molecular  imaging  and  cancer  cell-targeted  peptide-based  delivery 
tools  to  help  improve  efficacy  of  the  targeted  agents.  Specifically,  our  objectives  are: 


•  To  validate  preclinically  and  clinically  several  key  signaling  pathways  and  their  agents  for 
therapeutic  potentials  alone  or  in  combination  with  each  other  or  with  chemo  and  /or 
radiotherapy 

•  To  explore  applications  of  molecular  imaging  for  targeted  therapy  and  identify  cancer  cell- 
targeted  peptides  for  systemic  delivery  of  therapeutic  and  imaging  agents 

•  To  discover  and  evaluate  new  molecular  abnormalities  and  therapeutic  predictors  in  lung 
cancer 

•  To  develop  an  educational  program  for  teens  and  young  adults  for  smoking  risk  and 
resultant  lung  cancer  occurrence. 


IMPACT  is  composed  of  6  research  projects,  1  Biostatistics  Core,  1  Molecular  Pathology  Core, 
1  Molecular  Imaging  Core,  2  career  development  projects,  and  2  developmental  research 
projects.  Here  we  present  their  scientific  progresses  in  the  fourth  grant  year  as  follows. 
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Project  1:  Targeting  epidermal  growth  factor  receptor  signaling  to  enhance  response  of 
lung  cancer  to  therapeutic  radiation. 

(PI  and  co-PI:  Raymond  E.  Meyn,  Ph.D.,  Ritsuko  Komaki,  M.D.) 

In  spite  of  significant  technical  advances  including  intensity-modulated  radiation  therapy  (IMRT) 
and  chemoradiation,  locally  advanced  lung  cancer  continues  to  have  a  dismal  prognosis  as 
many  patients’  tumors  appear  to  be  resistant  to  radiation  therapy.  The  molecular  basis  for 
radiation  resistance  is  not  fully  understood,  but  tumor  cells  have  an  enhanced  survival  response 
that  involves  increased  capacity  for  DNA  repair  and  suppressed  apoptosis.  Both  apoptosis 
propensity  and  DNA  repair  capacity  are  thought  to  be  partly  controlled  by  the  upstream  signal 
transduction  pathways  triggered  by  EGFR  activation,  which  is  constitutively  activated  in  many 
NSCLCs,  and  its  activation  leads  to  a  radiation-resistant  phenotype.  We  hypothesize  that  the 
response  of  NSCLC  to  radiation  can  be  improved  through  the  use  of  inhibitors  of  EGFR 
signaling. 

Aim  1  To  test  the  combination  of  external  beam  radiation  and  the  selective  EGFR- 
tyrosine  kinase  inhibitor  erlotinib  (Tarceva)  in  locally  advanced  NSCLC. 

Summary  of  Research  Findings 

2005-1023,  A  Phase  II  Study  of  Tarceva  (erlotinib)  in  Combination  with  Chemoradiation  in 
Patients  with  Stage  IIIA/B  Non-Small  Cell  Lung  Cancer  (NSCLC),  completed  a  comprehensive 
review  by  our  Institutional  Review  Board  (IRB),  the  US  Department  of  Defense  (DoD), 
Genentech,  and  Food  and  Drug  Administration  (FDA),  and  was  finally  approved  in  November 
2007.  The  trial  was  subsequently  activated  (11/20/07)  and  patient  accrual  has  begun.  An 
amendment  is  pending  to  reduce  total  accrual  from  72  patients  to  48  patients  by  using 
progression-free  survival  rather  than  time  to  progression  as  the  endpoint. 

This  trial  uses  chemoradiotherapy  followed  by  a  molecularly  targeted  treatment  (erlotinib, 
targeting  the  EGFR  tyrosine  kinase)  with/without  radiotherapy  for  stage  III  NSCLC  to  improve 
the  therapeutic  ratio  (i.e.,  increase  malignant  cell  cytotoxicity  without  increasing  normal  cell 
cytotoxicity).  The  primary  objective  is  to  determine  the  efficacy  of  concurrent  chemoradiotherapy 
followed  by  erlotinib/radiation  and  erlotinib  alone,  which  will  be  repeated  for  7  weeks,  then  two 
cycles  of  consolidation  chemotherapy  as  measured  by  progression-free  survival.  Secondary 
objectives  include  determining:  1)  safety,  as  measured  by  the  rate  of  grade  3  or  worse  non- 
hematological  toxicity  (dose-limiting  toxicity,  DLT)  occurring  prior  to  the  beginning  of 
consolidation  therapy  (including  all  toxicities  attributed  to  chemoradiation  occurring  within  90 
days  of  the  start  of  radiation  therapy);  2)  compliance,  which  is  defined  to  be  completion  of 
concurrent  chemoradiation  and  erlotinib/radiotherapy  with  no  more  than  minor  variations;  3) 
response  rate  (complete  and  partial  response  rates);  4)  overall  survival  rates  (one-  and  two-year 
rates,  median  survival);  5)  disease  (local)  control  rate;  6)  association  between  EFGR  expression 
and  toxicity,  response,  overall  survival,  and  progression  (exploratory  analysis);  and  7) 
association  between  EGFR  expression  and  response  correlated  with  imaging  study.  A  total  of 
48  patients  will  be  treated  with  concurrent  chemoradiation  [RT:  63  Gy/35  fractions/7weeks  (+/-  5 
days),  1.8  Gy/  fraction,  a  total  dose  of  63.0  Gy  in  35  fractions  over  7  weeks  plus  Paclitaxel, 
45mg/m2,  and  carboplatin,  AUC=2,  weekly  on  day  1  for  7  weeks]  and  erlotinib  (erlotinib,  150  mg 
p.o.  daily  for  7  weeks,  starting  with  radiotherapy  on  day  2-5  followed  by  erlotinib  150  mg  p.o. 
alone  on  day  6  -7  for  7  weeks).  Patients  will  get  one  month  off  treatment,  followed  by 
consolidation  therapy  of  chemotherapy  alone  (weeks  11-17:  paclitaxel,  200  mg/m2,  and 
carboplatin,  AUC=6,  every  3  weeks  for  two  cycles). 
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Currently,  a  total  of  22  patients  were  enrolled  on  this  protocol  between  November  2007  and 
February  8,  2009,  with  an  average  accrual  rate  of  1.6  patients/  month.  Accrual  rate  for  the  first 
several  months  after  activation  was  lower  than  predicted,  but  has  increased  to  2-4 
patients/month  since  August  2008.  We  anticipate  completion  of  the  study  within  the  next  12 
months;  thus,  a  request  for  a  no-cost  extension  to  continue  to  support  this  work  has  been 
submitted. 

Response 

Fourteen  patients  have  completed  the  treatment  regimen  per  protocol.  Twelve  of  these  patients 
had  a  measurable  response  (RECIST)  to  the  treatment;  seven  patients  had  a  complete 
response  (CR)  and  five  patients  achieved  partial  response  (PR).  One  of  the  patients  exhibited 
disease  progression  during  the  treatment  schedule  and  treatment  was  discontinued.  Another 
patient’s  tumor  response  was  not  able  to  be  assessed  due  to  the  tumor  being  obscured  by 
radiation  pneumonitis.  During  the  follow-up  time  period,  one  patient  developed  a  brain 
metastasis  two  months  after  completion  of  treatment,  with  CR  in  the  primary  disease  site  of  the 
left  lingular  lobe  of  the  lung.  A  second  patient  developed  a  malignant  pleural  effusion  two 
months  after  completion  of  his  treatment  with  PR  in  the  primary  disease  site.  A  third  patient 
failed  in  the  primary  site  where  he  received  radiotherapy,  eight  months  after  completion  of  his 
treatment,  without  any  distant  metastasis.  The  rest  (10/14)  of  patients  are  in  the  follow-up 
process  with  visits  and  scans  for  tumor  assessment  every  three  months.  Eight  of  these  patients 
have  not  yet  returned  for  their  first  follow-up  visit,  so  it  is  too  early  to  summarize  these  results. 

Toxicity 

Toxicity  data  is  available  for  22  patients  either  having  completed  therapy,  or  who  are  presently 
receiving  treatment.  Severe  acute  toxicities  (grade  3  or  higher  according  to  CTC.3)  related  to 
treatments  were  recorded  as  the  following  events: 

•  Rash,  Grade  3  in  3  patients 

•  Acne,  Grade  3  in  2  patients 

•  Diarrhea,  Grade  3  in  1  patient 

•  Pneumonitis,  Grade  3  in  3  patients 

•  Leukopenia,  Grade  3  in  3  patients,  Grade  4  in  1  patient 

•  Neutropenia  Grade  3  in  1  patient,  Grade  4  in  2  patients 

•  Thrombocytopenia,  Grade  3  in  1  patient 

•  Hypomagnesemia,  Grade  3  in  1  patient 

•  Hypokalemia,  Grade  3  in  1  patient 

•  Pneumonia,  Grade  3  in  4  patients 

•  Dehydration,  Grade  3  in  1  patient. 

Protocol  2005-1023  appears  to  have  a  toxicity  profile  that  is  comparable  to  other 
chemoradiotherapy  regimens  reported  in  this  patient  population.  Preliminary  response  data 
seems  favorable,  although  it  is  too  early  in  the  study  to  be  certain  of  whether  this  result  is 
significant.  Continuation  of  the  study  is  warranted. 

Correlation  of  biomarkers  from  the  biopsy  specimen  from  the  primary  site  with  response  is 
underway.  We  are  going  to  analyze  correlation  between  biomarkers  and  responses  as  well  as 
toxicity  when  20  patients  will  complete  treatment.  The  results  will  be  submitted  to  a  major 
international  scientific  meeting  such  as  AACR  or  others.  If  the  rate  of  accrual  of  3-4 
patients/month  continues,  we  will  complete  this  trial  by  the  end  of  this  year. 
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Aim  2  To  test  the  hypothesis  that  activation  of  the  EGFR  pathway  leads  to  radiation 
resistance  in  NSCLC  cells  due  to  an  enhanced  capacity  for  repairing  DNA 
lesions. 

Summary  of  Research  Findings 


As  reported  previously,  2  NSCLC  cell  lines,  A549 
and  HI  299,  displayed  a  significant 
radiosensitization  by  gefitinib  correlating  with  a 
gefitinib-mediated  inhibition  of  radiation-induced 
DNA  double-strand  breaks  (DSBs).  Since  then, 
we  have  extended  this  analysis  to  other  cell  lines 
including  additional  NSCLC  lines  and  normal  cells 
as  well.  Although  some  other  NSCLC  cell  lines 
are  radiosensitized  by  gefitinib,  we  also 
discovered  that  some  lines  are  rendered 
radioresistant  by  gefitinib  treatment.  For  example, 
survival  curves  for  H322  cells  have  shown  that 
gefitinib  induces  a  rather  profound  degree  of 
radioresistance  (Fig.  1).  A  similar  effect  is  seen  in 
several  normal  cell  lines  including  normal  human 
bronchial  epithelial  cells  (HBECs)  (Fig.  2). 


Although  we  haven’t  analyzed  all  of  these  different  cell  lines  for  the  effects  of  gefitinib  on  DSB 
repair,  we  have  examined  the  results  on  the  H322  line,  and  gefitinib  appears  to  accelerate  the 
repair  of  DSBs  compared  to  the  radiation  alone  case  (Fig.  3).  During  this  past  year,  we  have 
begun  an  extensive  investigation  to  uncover  the  basis  for  these  differences  in  the  ability  of 
NSCLC  cell  lines  to  be  radiosensitized  by  EGFR  inhibitors.  As  will  be  presented  in  the  next 
section,  we  hypothesize  that  this  difference  is  related  to  whether  the  cell  line  is  epithelial-like  or 
mesenchymal-like. 
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Aim  3  To  test  the  hypothesis  that  clinically  useful  inhibitors  of  EGFR  signaling 
abrogate  DNA  repair  capacity,  restore  apoptotic  response  and  radiosensitize 
NSCLC  cells. 


Summary  of  Research  Findings 


Several  recently  published  papers  have  reported  that  the  sensitivity  of  NSCLC  cells  to 
antagonists  of  the  EGFR  correlates  with  whether  they  have  undergone  the  epithelial-to- 
mesenchymal  transition  (EMT).  Specifically,  cells  that  have  undergone  EMT  proved  resistant  to 
gefitinib  or  erlotinib  as  a  single  agent.  We  have  now  observed  an  EMT-related  relationship 
amongst  our  NSCLC  lines,  some  of  which  were  the  same  lines  as  in  these  published  reports. 
Cell  lines  that  were  reported  to  be  epithelial-like  (sensitive  to  gefitinib)  were  rendered 
radioresistant  by  the  drug  in  our  studies  and  cell  lines  reported  to  be  mesenchymal-like 
(resistant  to  gefitinib)  were  radiosensitized  by  the  drug  in  our  experiments.  This  suggests  that 
the  activities  of  gefitinib  for  drug-induced  cytotoxicity  and  drug-modulation  of  radiosensitivity  are 
different.  Based  on  the  correlation  between  our  observed  results  and  reported  results  in  the 
literature  regarding  the  same  cell  lines,  we  decided  to  expand  our  panel  of  cell  lines  to  include 
additional  lines  identified  in  those  reports.  Thus,  we  have  now  analyzed  a  total  of  8  NSCLC  cell 
lines  for  their  response  to  a  pre-irradiation  treatment  with  1  pM  of  gefitinib.  Although  complete 
clonogenic  survival  curves  were  generated  for  all  8  lines  with  and  without  gefitinib,  we  have 
summarized  the  results  in  Figure  4  where  we  present  the  SF2  values  for  all  of  these  survival 
curves  (error  bars  omitted  from  Fig.  4).  However,  each  value  is  the  average  of  3  independent 
experiments,  each  plated  in  triplicate.  Statistical  analysis  indicated  that  the  gefitinib  treated  vs. 
non-gefitinib  treated  value  was  significant  at  the  p<.05  level  for  each  cell  line. 
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The  EMT  is  primarily  indicated  and,  in 
at  least  some  cases,  dictated  by  the 
presence  or  loss  of  E-cadherin 
expression.  To  validate  that  the  cell 
lines  in  our  NSCLC  panel  had  or  had 
not  undergone  EMT,  we  performed 
immunoblot  analysis  of  E-cadherin 
expression  for  all  lines  under  study. 
The  results  indicated  that,  as  reported 
for  these  same  lines  in  the  literature, 
the  H322,  H358,  H292  and  Calu3  lines 
express  E-cadherin  and  the  A549, 
HI 299,  H460,  Calu6  and  H520  lines  do 
not  (Fig.  5).  The  HBECs,  as  expected, 
express  E-cadherin  (not  shown).  Cells 
that  have  undergone  EMT  and  have 
lost  E-cadherin  sometimes  express 
mesenchymal  markers  such  as 
vimentin,  although  this  is  not  an 
absolute  marker  for  EMT.  We  assessed  vimentin  in  our  panel  of  cell  lines;  all  of  the  lines  with 
the  mesenchymal  phenotype  express  vimentin  except  A549  cells,  which  express  N-cadherin 
and  fibronectin  (2  other  mesenchymal  markers)  instead.  One  cell  line  with  epithelial  phenotype 
also  expresses  vimentin,  illustrating  that  this  marker  is  not  100%  specific  compared  to  E- 
cadherin  for  EMT  status. 


A549  HI  299  H460  H520 

E-cadherin  negative 


H322  H358  CaLu-3  H292 


E-cadherin  positive 


Cell  line 


1 4  Cocopariun  of  8FZ  Tokm  in  E-cadherin  positive  Ten 
cdl  Jinn  following  exposure  to  Gafitimb.  Ceili  were  exposed  to  Oefilimb  (IjiM)  fix  24 
h  following  which  they  wore  assessed  for  radiaHXBatisrtfcn  by  QtanaganD  odLsarviviL 
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A549  H1299  H460  B52fl  H322  H358  Cakn3  H292 

E-Cadherm 

*  •  J1* 

Vimentin  — ► 

Figure  5.  hnmunoblot  analysis  for  EMT  madras  in  a  pend  of  NSCLC  cdl  lines 

Although  we  have  established  the  relationships  of  EMT  status  to  tumor  cell  radiosensitivity  in 
our  panel  of  8  NSCLC  lines  (the  epithelial  lines,  H358,  H322,  CaLu-3  and  H292,  have  SF2 
values  of  <0.45  and  the  mesenchymal  lines,  A549,  H1299,  H460  and  H520,  have  SF2  values  of 
>0.50),  this  is  an  observed  correlation  and  does  not  prove  an  absolute  relationship  between  E- 
cadherin  expression  and  cell  sensitization.  Therefore,  we  have  begun  a  series  of  experiments 
to  test  whether  restoring  E-cadherin  expression  in  the  mesenchymal-like  lines  sensitizes  them 
to  radiation.  We  have  transfected  HI 299  cells  with  a  CDHt-expression  vector  and  isolated 
clones  based  on  drug-selection  markers  in  the  vector.  Several  clones  were  selected  and 
analyzed  for  E-cadherin  expression.  One  such  clone,  H1299-CDH1  (clone  #1),  has 
subsequently  been  tested  for  radiosensitivity  compared  to  the  parental  line  and  the  results  are 
presented  in  Fig.  6.  Although  this  experiment  will  have  to  be  repeated  and  additional  clones 
tested,  this  result  suggests  that  restoration  of  E-cadherin  expression  in  mesenchymal-like 
NSCLC  cells  produces  a  radiosensitizing  effect. 


A 


E-cadherin 

Actin 


CDH1  clones 


Control  #1  #2  #3 


Figure  6:  Overexpression  of  E-cadherin  makes  cells 
radiosensitive.  (A)  HI 299  cells  were  stably  transfected 
with  a  E-cadherin  expression  vector  and  (B)  assessed  for 
radiation  response  by  clonogenic  cell  survival.  Parental 
HI 299  cells  served  as  controls. 


Dose  (Gy) 


Figure  7:  Effect  of  gefitinib  on  E-cadherin  overexpressing 
NSCLC  cells.  H1299  cells  stably  overexpressing  E- 
cadherin  (CDH1)  were  treated  with  gefitinib  (lpM  for  24 
h)  and  radio  sensitization  assessed  by  clonogenic  cell 
survival.  HI 299  parental  cells  served  as  controls. 


Using  the  same  clone  of  H1299  cells  with  restored  E-cadherin  expression,  described  above,  we 
tested  the  effects  of  a  gefitinib  pretreatment  compared  to  parental  cells.  The  results  of  this 
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experiment  indicate  that  instead  of  being  sensitized  to  radiation,  as  shown  for  the  parental  cells 
in  Figure  1,  the  same  pretreatment  with  gefitinib  exerts  a  radioprotective  effect  on  these  F1 1299- 
CDH1  cells  (Fig.  7).  Again,  these  experiments  will  have  to  be  repeated  and  additional  clones 
produced  and  analyzed.  Moreover,  this  H1299-CDH1  clone  was  compared  to  parental  cells  in 
these  experiments  and  we  are  in  the  process  of  making  empty-vector  control  cell  lines  to  use  in 
further  studies  related  to  these  questions.  E-cadherin  expressing  clones  of  A549  cells  have 
been  produced  and  initial  testing  of  one  clone  suggests  that  while  gefitinib  pretreatment  does 
not  render  the  A549-CDH1  cells  radioresistant,  it  no  longer  induces  a  radiosensitizing  effect 
compared  to  results  in  parental  cells  as  shown  in  Figure  1  (data  not  shown). 

Aim  4  To  test  the  hypothesis  that  targeting  both  EGFR  and  its  downstream  signaling 
pathways  will  have  at  least  an  additive  radiosensitizing  effect  on  NSCLC. 

Summary  of  Research  Findings 

It  is  now  understood  that  resistance  to  EGFR  inhibitors  such  as  gefitinib  and  erlotinib  may  be 
due  to  compensatory  signaling  pathways.  Two  such  pathways  are  now  known:  the  c-Met  and 
IGF1 R  mediated  pathways.  During  this  past  funding  period,  we  have  continued  to  examine  both 
of  these  pathways  in  order  to  understand  why  some  cell  lines  are  resistant  to  radiosensitization 
by  inhibition  of  EGFR.  For  these  experiments,  we  have  used  commercially  available  inhibitors 
of  c-Met  (e.g.,  SU1 1274)  and  of  IGF1 R  (e.g.,  AG1024). 

We  examined  the  ability  of  both  of  these  inhibitors  to  radiosensitize  NSCLC  cell  lines.  Last 

year,  we  reported  that  the  c-Met 
inhibitor  SU1 1274  has  a  significant 
radiosensitizing  effect  on  F1 1299 
cells,  and  we  have  extended  this 
finding  to  H460  cells  (Fig.  8).  This 
finding  is  consistent  with  the 
emerging  hypothesis  that 
resistance  to  radiosensitization 
with  EGFR  inhibition  may  be  due 
to  compensatory  signaling  by  the 
c-Met  receptor,  and  that  inhibition 
of  c-Met  signaling  may  be  a  useful 
strategy  for  radiosensitizing 
NSCLC  cells  either  with  or  without 
combining  with  EGFR  inhibitors. 
Similarly,  we  have  shown  that  the 
HI  299  and  FI358  cells  are 
potentially  radiosensitized  by  the 
IGF1R  inhibitor,  AG1024  (Fig.  9). 
Thus,  inhibition  of  the  IGF1R 
pathway  may  also  be  a  viable 
strategy  for  enhancing  the 
radiation  response  of  NSCLC 
cells. 
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Dose  (Gy)  Dose  (Gy) 

Figure  9:  IGF-1R  inhibitor  radiosensitizes  NSCLC  cells.  H129  and  H358  cells  were  treated  with  AG1024  (2  or  5  (iM 
for  24  h)  and  assessed  for  radiosensitization  by  clonogenic  cell  survival  immediately. 


Aim  5  To  test  whether  the  strategies  developed  in  Specific  Aims  2-4  have  efficacy  in 
a  xenograft  tumor  model. 

Summary  of  Research  Findings 


We  also  investigated  the  ability  of  erlotinib  to  mediate  its  radiosensitizing  effect  in  vivo. 


06  viable  HI 299  cells,  suspended  in 
PBS,  into  the  hind  legs  of  6-8 
week-old  athymic  nude  mice 
(nu/nu;  Charles  River). 
Treatment  was  initiated  when  the 
tumors  reached  100mm3  with 
either  erlotinib  alone  (60mg/kg 
daily  for  a  total  of  nine  doses), 
radiation  alone  (5  Gy),  or  erlotinib 
(60mg/kg  daily)  plus  radiation. 
For  in  vivo  radiation  treatments, 
animals  bearing  xenograft  tumors 
were  irradiated  while  anesthetized 
using  a  60Co  teletherapy  unit. 
Radiation  was  given  24  h  after  the 
last  injection  of  erlotinib.  Tumor 
growth  delay  was  assessed 
following  treatment.  Tumors  were 
measured  every  other  day  in  two 
orthogonal  dimensions,  and  the 
tumor  volume  was  estimated 
assuming  an  ellipsoid  shape. 
Animals  were  sacrificed  when  the 
tumor  volume  reached  1000mm3. 
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A  modest  tumor  growth  delay  was  observed  in  the  group  that  received  erlotinib  plus  radiation 
treatment  when  compared  with  radiation  or  erlotinib  treatments  alone  (Fig.  10). 

Key  Research  Accomplishments 

■  Entered  22  patients  onto  the  erlotinib  (Tarceva)  plus  radiotherapy  for  locally  advanced 
NSCLC  trial  and  completed  evaluation  of  12  of  these  patients. 

■  Discovered  relationship  between  the  epithelial-to-mesenchymal  transition  and 
radiosensitivity  of  NSCLC  cells. 

■  Demonstrated  that  pretreatment  with  gefitinib  exerts  a  radioprotection  of  H1299-CDH1 
cells. 

■  Demonstrated  that  small  molecule  inhibitors  of  both  c-Met  and  IGF-1R  produce  a 
significant  radiosensitizing  effect  on  NSCLC  cells. 

■  Completed  an  assessment  of  the  combination  of  erlotinib  (Tarceva)  and  radiation  in  a 
NSCLC  xenograft  tumor  model. 

Conclusions 


We  conclude  that  the  epithelial-to-mesenchymal  transition  (EMT)  plays  a  significant  role  in 
governing  not  just  the  intrinsic  radiosensitivity  of  NSCLC  cells,  but  also  their  sensitivity  to 
inhibitors  of  the  epidermal  growth  factor  receptor  (EGFR)  and  the  ability  of  such  inhibitors  to 
radiosensitize  these  cells.  It  would  be  useful  to  assess  the  EMT  status  of  patients  treated  with 
these  combinations.  In  spite  of  this  finding,  results  suggest  that  such  combinations  might  be 
useful  in  the  clinic.  In  addition,  we  conclude  that  targeting  other  growth  factor  receptors  such  as 
the  c-Met  and  IG-F1R  receptors  may  be  an  alternative  strategy  to  using  EGFR  inhibitors. 


Project  2:  Molecular  Imaging  of  EGFR  Expression  and  Activity  in  Targeting  Therapy 
of  Lung  Cancer 

(PI  and  co-PI:  Juri  Gelovani,  M.D.,  Ph.D.;  Roy  Herbst,  M.D.,  Ph.D.) 

Aim  1  To  synthesize  novel  pharmacokinetically  optimized  124l  and  1SF-Iabeled 
IPQA  derivatives  for  PET  imaging  of  EGFR  kinase  activity  and  conduct  in 
vitro  radiotracer  accumulation  studies  in  tumor  cells  expressing  different 
levels  of  EGFR  activity. 

During  the  third  year  of  funding  of  Project  2,  our  research  continued  to  be  focused  on  Aims  2 
and  3  of  this  Project.  This  Aim  was  completed  during  the  previous  project  year. 

Aim  2  To  assess  the  biodistribution  (PK/PD)  and  tumor  targeting  by  novel  124l  and 
18F-labeled  EGFR  kinase-specific  IPQA  derivatives  using  PET  imaging  in 
orthotopic  mouse  models  of  lung  cancer  and  compare  in  vivo  radiotracer 
uptake/retention  with  phospho-EGFR  levels  in  situ. 

Summary  of  Research  Findings 

As  part  of  Aim  2,  we’ve  developed  the  fifth-generation  of  more  water-soluble  polyethylene 
glycol  (PEG)-ylated  3-iodo-4-(phenylamino)quinazoline-6-acrylamide  (IPQA)  derivatives 
labeled  with  18F,  which  will  simplify  the  translation  of  EGFR  kinase  imaging  agents  into  the 
clinic  in  comparison  with  124I-JGAP5.  These  novel  fifth  generation  18F-labeled  compounds 
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were  synthesized  by  the  Imaging  Core  D.  The  compound  [18F]fluoro-hexa-PEG-IPQA  was 
developed  based  on  our  previous  studies  to  improve  tracer’s  solubility,  resulting  in  less 
hepato-biliary  clearance  and  a  longer  plasma  circulation  half-time. 


Figure  1.  Time-dependent  accumulation  and  washout  of  [18F]fluoro-hexa-PEG-IPQA  in  different  NSCLC  cell  lines. 
Similar  studies  were  preformed  after  treatment  with  Iressa. 


[18F]fluoro-hexa-PEG-IPQA  was  first  assessed  in  vitro  for  its  accumulation  and  washout 
kinetics  in  human  NSCLC  cells  (PC14,  H441,  H3255,  HI 975)  with  different  EGFR  signaling 
profiles  (Fig.  1).  The  highest  cell-to-medium  ratios  of  the  four  cell  lines  were  observed  at 
approximately  20  minutes  when  a  steady  state  of  post-treated  radiotracer  exposure  was 
reached. 
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Figure  2.  Washout  phases  of  [18F]fluoro-hexa-PEG-IPQA  in  different  NSCLC  cell  lines. 


The  amount  of  accumulation  of  [18F]fluoro-hexa-PEG-IPQA  is  indicative  of  the  levels  of 
EGFR  expression  and  activity  in  corresponding  cell  lines.  As  predicted  for  the  chosen  cell 
lines,  H3255  had  a  highest  uptake  (10  folds)  among  all  cell  lines  (580  ±  109  cell/medium 
ratios)  tested.  Among  the  4  cell  lines,  H3255  was  the  only  one  with  increased  levels  of 
retained  [18F]fluoro-hexa-PEG-IPQA  after  washout  phases  and  increased  incubation  time  of 
the  tracer  (Fig.  2).  The  H3255  washout  phase  can  be  described  as  a  very  rapid  initial  loss 
(50%)  of  the  tracer,  followed  by  a  plateau  with  no  further  washout.  Results  indicated  that 
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binding  of  tracer  to  H3255  cells  is  continual  and  strong.  This  washout  characteristic  was  not, 
however,  observed  in  other  3  cell  lines.  Instead,  they  had  shown  rapid  initial  losses  of 
[18F]fluoro-hexa-PEG-IPQA,  followed  by  slow  decreases  with  increased  incubation  time  of 
the  tracer.  The  regression  lines  of  washout  profiles  of  these  3  cell  lines  were  relatively  flat 
(with  negative  slopes),  indicating  that  significant  uptake  of  [18F]-fluoro-hexa-PEG-IPQA  took 
place  at  the  initial  incubation  stage  with  limited  capacity. 

In  PC14  and  HI 975,  the  profiles  of  accumulation  and  washout  of  [18F]fluoro-hexa-PEG-IPQA 
were  similar  (PC14  11.0  ±  0.9;  H1975  9.1  ±  0.4)  and  the  retention  of  [18F]fluoro-hexa-PEG- 
IPQA  after  washout  were  under  10  cell/medium  ratio.  In  the  EGFR™7,  H441  which  has 
similar  EGFR  expression  as  that  of  H3255,  the  cell-to-medium  ratio  was  more  than  10  times 
lower  than  H3255.  The  residual  level  of  [18F]-fluoro-hexa-PEG-IPQA  didn't  increase  over 
time  after  washout. 

Aim  3  Using  selected  124l  or  18F-labeled  IPQA  derivative,  to  conduct  pre-clinical 
studies  in  animals  with  orthotopic  models  of  lung  cancer  xenografts  with 
different  levels  of  EGFR  expression/activity,  and  to  assess  the  value  of  PET 
imaging  as  the  inclusion  criterion  for  therapy  by  EGFR  inhibitors,  as  well 
as  for  monitoring  the  efficacy  of  treatment  with  EGFR-targeted  drugs. 

In  vivo  [18F]fluoro-hexa-PEG-IPQA  PET  imaging  of  the  H3255  subcutaneous  tumor 
xenograft  demonstrated  superior  binding  of  the  tracer  (Figs.  3  and  5).  MicroPET  images  in 
mice  provided  the  heterogeneous  accumulation  of  [18F]fluoro-hexa-PEG-IPQA  in  H3255  s.c. 
tumor  xenografts.  Hot  spots  and  uptake  layers  could  be  identified  in  the  central  and 
peripheral  regions  of  the  tumor  (Fig.  3,  left  middle  panel,  axial  image  at  60  min  post¬ 
administration).  The  accumulation  of  [18F]fluoro-hexa-PEG-IPQA  displayed  a  one-phase 
exponential  association  kinetics  in  the  first  10-15  minutes.  The  accumulation  reached 
equilibrium  at  ~  20  minutes  (2.24  ±  0.11  %ID/g)  after  administration  of  radiotracer. 
Thereafter,  the  accumulation  slowly  increased  till  the  last  imaging  time  point  (55-65  min). 
This  seemed  to  indicate  that  the  binding  between  [18F]fluoro-hexa-PEG-IPQA  and 
EGFRL858R  was  on  going  during  the  entire  imaging  period.  The  optimal  tumor-to-muscle 
concentration  ratio  was  2.07  ±  0.15  and  tumor-to-blood  ratio  was  1.10  ±  0.21.  We  recorded 
a  measurable  difference  between  these  results  and  the  [18F]fluoro-hexa-PEG-IPQA  PET 
imaging  of  H441  tumor  xenograft.  Although  H441  has  the  same  magnitude  of  EGFR 
expression,  it  has  lower  EGFR  activity  as  compared  with  H3255.  H441  reached  equilibrium 
in  the  first  3-6  minutes  post  injection  at  a  level  of  70  -  75%  accumulation  of  that  in  H3255, 
and  maintained  a  similar  slow  uptake  over  the  subsequent  period.  PC14  and  H1975,  the 
negative  controls  in  this  study,  showed  much  lower  uptake  ratio  which  is  consistent  with  the 
results  observed  in  cellular  studies.  In  summary,  the  accumulation  of  H3255  was  1.4,  6.3 
and  7.6  folds  more  than  that  of  H441,  PC14  and  HI 975  (p  <  0.001;  paired  student  t-test), 
respectively. 

After  treatment  with  Iressa,  H3255,  the  most  sensitive  cell  line,  was  significantly  inhibited. 
The  uptake  ratio  decreased  from  2.27  before  treatment  to  0.65%  ID/g  (at  15  min  post¬ 
administration)  after  treatment  Compared  to  H441  control  group  (1.89  %ID/g),  there  was 
only  15%  uptake  ratio  left  in  H441-lressa  group  (0.49  %ID/g).  No  significant  difference  was 
observed  in  the  PC14  and  HI 975  before  and  after  Iressa  treatment. 

The  rate  of  [18F]fluoro-hexa-PEG-IPQA  accumulation  by  tumor  xenografts  in  mice,  ki,  can  be 
obtained  from  Patlak  plot  shown  in  Figure  4.  The  ki  values  were  determined  between  5-20 
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minutes  (enzymatic  phase)  after  radiotracer  administration  and  they  corresponded  to  the 
EGFR  expression  and  activity. 

Quantitative  autoradiography  (QAR) 

QAR  distribution  was  validated  by  whole-body  quantitative  autoradiography  (Fig.  3, 
transverse  section);  high  resolution  QAR  images  demonstrated  an  impressive  difference  in 
the  accumulation  among  the  four  s.c.  tumor  xenograft  models  (Fig.  3,  left  middle  panel). 
H3255  (3.01  ±  0.84)  had  the  highest  tracer  accumulation  and  it  was  followed  by  H441  (2.53  ± 
0.15),  H1975  (1.97  ±  0.80)  and  PC14  (1.59  ±  0.58).  High  levels  of  radioactivity  were  also 
observed  in  brown  fat,  skin,  liver,  bone  marrow,  but  considered  relatively  lower  in  heart, 
skeletal  muscle  and  spinal  cord.  In  contrast,  the  Iressa-treated  group  presented 
approximately  4  times  lower  uptake  in  H3255  (from  3.01  to  0.78  %  I  D/g )  and  approximately  3 
times  lower  in  H441  (from  2.53  to  0.92  %ID/g).  Similar  accumulation  was  observed  in  PC14 
and  HI 975  before  and  after  treatment  with  Iressa  (Fig.  3,  lower  middle  panel). 


Control 


%ID/g 


UMQ7C  DP.1  A  U1Q7C  A 


Figure  3:  Representative  PET  and  QAR  images  of  [18F]fluoro-hexa-PEG-IPQA  in  mice 
bearing  different  NSCLC  cell  lines. 
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Figure  4.  Patlak  plots  of  [18F]-PEG6-IPQA  in  mice  bearing  different  NSCLC  cell  lines.  The  accumulation  rate,  ki, 
was  calculated  based  on  the  input  function  obtained  from  blood. 


Figure  5.  The  dynamics  of  accumulation  and  clearance  of  [18F]fluoro-hexa-PEG-IPQA  derived  radioactivity  (%ID/g)  in  tumors, 
individual  organs,  and  tissues. 
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After  2  hours  administration  of  [18F]fluoro-hexa-PEG-IPQA,  radioactivity  was  well  distributed  into 
organs  (Table  1).  The  highest  uptake  was  observed  in  small  intestine  (8.12  ±  0.13  %ID/g)  and  it 
was  followed  by  kidneys  (4.58  ±  0.82  %ID/g),  liver  (3.30  ±  0.28  %ID/g),  and  the  lungs  (3.25  ± 
0.29  %ID/g).  The  distribution  of  tracer  in  different  tumors  also  corresponded  to  the  level  and 
magnitude  of  EGFR  expression  and  activity  (H3255  2.55  ±  0.58,  H441  2.20  ±  0.40,  PC14  1 .86  ± 
0.39,  HI 975  1 .86  ±  0.24  %ID/g).  In  the  blocking  study,  an  excess  of  Iressa  was  administered  1 
hour  before  injection  of  radiotracer.  We  observed  higher  radioactivity  in  the  majority  of  examined 
organs  (Fig.  5). 


%lp/g  {Average  ±Sft) 


Organ 

Control 

Iressa 

Blood 

1.99  ±0.07 

225  ±0.07 

liver 

3.3010,28 

3.56  ±0.07 

Kidney 

4.58  :t  0.82 

623.1 1.98 

Small  intestine 

8.12  Jt  0.13 

51112.59 

Large  intestine 

2.03  ±0.03 

201  ±0.58 

Spleen 

1.88  ±0.15 

2001-0.30 

PaitLTttLS 

1.2710.09 

1.07  1  024 

Lung 

3.2S  10.29 

3.5211.84 

Heart 

1.601  0.01 

1.77  ±0.24 

Stomach 

1.92  ±0.15 

1.78  ±0.1 2 

Drain 

1.44  ±001 

1.29  ±0.27 

Hone 

145 10.29 

1.1410.33 

Muscle 

12910.01 

1.421020 

Skin 

1.48+0.01 

1.79+0.34 

*0255 

2.55+0.58 

2.34+0.36 

Hill 

22010.40 

2.6210.48 

PCI  4 

1.8010.39 

24210.42 

11*975 

1.86+0.24 

1.36+0.16 

Table  1.  Radioactivity  levels  (%ID/g)  in  different  tissues  at  the  end  of 
PET  imaging. 


Study  of  pharmacokinetics,  biodistribution,  radiation  dosimetry,  and  metabolites  of  f— Flfluoro- 
hexa-PEG-IPQA  in  nonhuman  primates 

One  of  the  most  important  pre-clinical  studies  for  a  novel  PET  imaging  tracer  in  the 
development  process  is  to  evaluate  the  tracer  in  nonhuman  primates.  During  this  study,  the 
pharmacokinetics,  biodistribution,  excretion  pathway,  metabolites,  and  radiation  dosimetry  of  the 
tracer  can  be  assessed.  Based  on  these  results,  the  dosimetry  for  the  first  cohort  in  human 
phase  I  can  be  calculated  to  improve  the  overall  safety  of  the  study.  We  have  completed  this 
study  in  six  rhesus  macaques.  Results  of  the  study  constitute  an  important  part  in  the 
investigational  new  drug  (IND)  application  to  the  Food  and  Drug  Administration  (FDA)  for  their 
approval  of  our  phase  I  study. 


Six  healthy  rhesus  macaques  were  injected  intravenously  with  [18F]fluoro-hexa-PEG-IPQA 
(5mCi/animal).  Dynamic  imaging  was  performed  on  a  PET/CT  scanner  for  the  first  30  minutes 
covering  the  thoracic-abdominal  area,  and  it  was  followed  by  whole-body  static  imaging  at  30, 
60,  90,  120,  and  180  minutes.  The  biodistribution  and  radiation  dosimetry  estimates  were 
obtained  from  blood  sampling  and  volume  of  interest  analysis  (VOI)  data  measured  on  PET/CT 
images. 


Based  on  data  from  the  first  six  rhesus  macaques  available  to-date,  [18F]fluoro-hexa-PEG-IPQA 
exhibited  a  rapid  redistribution  after  i.v.  injection  and  a  relatively  fast  blood  clearance  with  half- 
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times  of  1 .4  minutes  (mono-exponential  fitting)  via  both  hepatobiliary  and  renal  pathways.  Data 
indicated  that  [18F]fluoro-hexa-PEG-IPQA  degraded  to  radioactive  metabolite  with  a  T1/2~  4min 
post  administration  (Fig.  6).  Inferior  vena  cava,  heart,  lung,  liver,  kidney,  gall  bladder,  small 
intestine,  upper  large  intestine,  and  urinary  bladder  were  visually  identified  as  organs  with 
moderate  to  high  tracer  uptakes  (Fig.  7).  The  critical  organ  was  the  gallbladder  (0.470 
mSv/MBq);  other  organs  with  higher  radiation  dose  were  the  kidney  (0.0999  mSv/MBq),  small 
intestine  (0.0694  mSv/MBq),  upper  large  intestine  wall  (0.0495  mSv/MBq),  and  liver  (0.0425 
mSv/MBq).  Lung  tissue  exhibited  low  uptake  of  18F-PEG6-IPQA. 


The  residence  time  of  [18F]fluoro-hexa-PEG-IPQA  for  each  source  organ  was  calculated  from 
whole-body  PET  images.  The  persistent  radioactivity  accumulation  in  the  gall  bladder,  liver, 
small  intestine,  and  kidneys  resulted  in  the  highest  residence  organ  time.  There  was  no 
observable  evidence  of  acute  cardiac  toxicity  based  on  EKG,  and  no  acute  or  delayed  systemic 
toxicity  based  on  blood  hematological  and  chemical  analyses  was  measured.  The  dose  for  the 
first  cohort  of  humans  in  the  phase  I  trial  will  be  based  on  the  dosimetry  calculation  from  this 
non-human  primate  study.  The  radiation  exposure  to  critical  organs  in  humans  will  be 
administered  at  doses  lower  than  the  regulated  limits. 
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Figure  7:  Time-activity  distribution  of  [18F]fluoro-hexa-PEG-IPQA  in  a  non-human  primate. 


Material  manufacturing 

We  have  initiated  manufacturing  the  non-radio  labeled  (cold)  fluoro-hexa-PEG-IPAQ  and  the 
precursor  with  Advanced  Biomedical  Compounds  (ABX).  The  synthesis  schemes  of  the  cold 
compound  were  developed  by  our  chemistry  team  (Fig.  8),  and  materials  will  be  delivered  to  us 
in  late  March  2009  for  use.  The  cold  fluoro-hexa-PEG-IPQA  (compound  1  from  schemes  1&2) 
will  be  used  in  the  IND-directed  toxicology  study  at  Charles  River  Laboratory  (CRL).  The 
precursor  (compound  9  from  scheme  3)  will  be  used  to  manufacture  18F-fluoro  hexa  PEG-1  PQA 
clinical  dose  for  phase  I  study. 
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We  have  finalized  with  CRL  the  toxicology  protocol  (Table  2)  for  the  proposed  rodent  study.  We 
incorporated  the  FDA’s  input  resulting  from  a  most  current  pre-IND  conference  we  had  with  the 
FDA  reviewers.  To  prepare  the  toxicology  study  for  initiation  as  soon  as  the  material  is 
available  from  ABX,  we  have  sent  CRL  our  analytical  method  to  begin  study  preparations  based 
on  the  properties  of  our  compound.  Results  from  this  study  constitute  part  of  the  pharmacology 
and  toxicology  section  of  the  IND  in  preparation. 
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Title: 

A  single-dose  IV  study  of  [18F]flouro-hexa-PEG-IPQA  in  rats 
with  a  14-day  recovery  period 

Study  site: 

Charles  River  facility,  preclinical  services  Arkansas 

Compliance: 

GLP 

Species/Strain: 

Sprague-Dawley  rats 

Dose  Formulation 

Preparation: 

Standard  dose  formulation  prepared  once.  Standard  sample 
collection  for  concentration  verification  once 

Dose  Formulation  Analysis: 

Analysis  on  1  occasion  using  validated  HPLC  methodology 

Dosing  regimen: 

Single  dose  on  Study  Day  1 . 

Route  of  Administration: 

Intravenous  ( 

bolus)  dose  administration 

Test  system: 

Group 

Dose 

Number  of  animals 

Male 

Female 

1 

Control 

10  M  +  5  R 

10  M  +  5  R 

2 

100  times 

10  M  +  5  R 

10  M  +  5  R 

M=  main  stud 
R=  Recovery 

y  animals  sacrificed  on  Study  Day  2. 
animals  sacrificed  on  Study  Day  15. 

Spares: 

10  animals 

Total  population: 

70  animals 

Age: 

Approximately  5-7  weeks  at  receipt 

Source: 

Charles  River 

Pretreatment  period: 

Approximately  10  days. 

Mortality  observations: 

Mortality/viability  observations  will  be  conducted  twice  daily  on 
all  animals. 

Detailed  observations: 

Daily  on  all  animals. 

Body  weight: 

Daily  on  all  animals 

Special  Assessments: 

General  examination  by  veterinarian  of  all  animals  for  study 
selection. 

Clinical  Pathology 
parameters: 

Coagulation,  hematology,  and  clinical  chemistry  taken  from  all 
animals  near  termination. 

Toxicokinetics  sample 
collection: 

No  bioanalytical  samples  obtained. 

Terminal  procedures: 

Full  gross  necropsy  with  organ  weights  and  tissue  collection 
on  all  main  and  recovery  animals. 

Histopathology: 

Standard  histopathology  on  65  tissue  sections  (including 
injection  site)  on  60  main  and  recovery  animals. 

Audited  draft  report: 

Audited  draft  report  provided  in  approximately  12  weeks  from 
the  time  of  in-life  completion. 

Table  2.  IND-directed  toxicology  study  protocol 


cGMP  manufacturing 

We  have  continued  improvements  on  lab-scale  radiosynthesis  (Fig.  9)  so  that  it  is  readily  scaled 
up  and  adopted  by  the  cGMP  manufacturing.  We  are  working  on  an  automated  cGMP 
manufacturing  process  using  GE  Tracerlab  with  product  specific  hardware  modifications.  This 
automated  process  will  be  fully  validated  to  meet  cGMP  requirements,  and  all  QC  methods  for 
product  release  will  also  be  validated.  Results  from  this  work  constitute  the  chemistry, 
manufacturing,  and  control  (CMC)  section  of  the  IND  in  preparation. 
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The  development  of  a  clinical  trial  protocol  is  underway,  which  will  be  led  by  Dr.  David  Stewart , 
M.D. (Deputy  Chair,  Thoracic/Head  and  Neck  Medical  Oncology).  Our  research  team  held  a 
preliminary  meeting  with  oncologists  to  discuss  patient  recruitment  and  study  objectives.  The 
primary  and  secondary  objectives  of  the  study  are  as  follows: 

Primary: 

•  To  determine  the  optimum  dosimetry  of  [18F]fluoro-hexa-PEG-IPQA  injection  based  on 
critical  organ  safety  and  detection  sensitivity. 

•  To  obtain  data  on  [18F]fluoro-hexa-PEG-IPQA  distribution,  pharmacokinetics,  radiation 
dosimetry,  and  metabolites. 

•  To  assess  the  safety  of  a  single  intravenous  administration  of  [18F]fluoro-hexa-PEG- 
IPQA  in  subjects  with  lung  carcinoma. 

Secondary: 

•  To  obtain  preliminary  data  on  the  feasibility  of  detection  of  both  primary  and  metastatic 
tumor  lesions  in  lung  carcinoma  using  [18F]fluoro-hexa-PEG-IPQA  as  compared  to 
standard  of  care  modalities. 

•  To  correlate  the  magnitude  of  tumor  [18F]fluoro-hexa-PEG-IPQA  uptake  and  retention 
with  EGFR  expression  and/or  treatment  response. 

Additional  meetings  will  be  held  to  discuss  and  refine  inclusion  and  exclusion  criteria,  imaging 
procedures,  safety  monitoring,  etc,  to  fully  complete  the  protocol.  We  anticipate  activation  of 
the  protocol  in  late  summer  or  early  fall  of  2009;  thus,  a  request  for  a  no-cost  extension  to 
continue  to  support  this  work  has  been  submitted. 
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Key  Research  Accomplishments 

•  Developed  the  fifth-generation  of  more  water-soluble  polyethylene  glycol  (PEG)-ylated  3- 
iodo-4-(phenylamino)quinazoline-6-acrylamide  (IPQA)  derivatives  labeled  with  18F  to  simplify 
the  translation  of  EGFR  kinase  imaging  agents  in  the  clinic  in  comparison  with  124I-JGAP5. 

•  Discovered  [18F]fluoro-hexa-PEG-IPQA  has  less  hepato-biliary  clearance  and  a  longer 
plasma  circulation  half-time. 

•  Completed  the  evaluation  of  [18F]fluoro-hexa-PEG-IPQA  as  a  novel  PET  imaging  tracer  in 
six  rhesus  macaques. 

•  Initiated  manufacturing  the  non-radio  labeled  (cold)  fluoro-hexa-PEG-IPAQ  and  the 
precursor  with  Advanced  Biomedical  Compounds  (ABX).  The  precursor  (compound  9  from 
scheme  3)  will  be  used  to  manufacture  18F-fluoro  hexa  PEG-IPQA  clinical  dose  for  phase  I 
study. 

•  Finalized  the  toxicology  protocol  for  the  proposed  rodent  study  with  Charles  River  Labs. 

•  Improved  lab-scale  radiosynthesis  to  meet  cGMP  manufacturing  guidelines. 

•  Held  a  preliminary  meeting  with  oncologists  to  discuss  patient  recruitment  and  study 
objectives  for  a  future  clinical  trial. 

•  Presented  study  results  at  the  2008  World  Molecular  Imaging  Congress. 

Conclusions 


The  development  of  [18F]fluoro-hexa-PEG-IPQA  into  a  clinical  trial  has  been  actively  pursued 
with  clearly  defined  milestones  and  timelines.  Studies  will  focus  on  determining  the  optimum 
dosimetry  of  [18F]fluoro-hexa-PEG-IPQA  injection  based  on  critical  organ  safety  and  detection 
sensitivity.  Data  will  be  obtained  on  agent  distribution,  pharmacokientics,  radiation  dosimetry, 
and  metabolites  to  build  upon  our  current  knowledge  base  and  to  shape  future  studies.  We 
have  initiated  all  fronts  of  the  IND  development  work  to  expedite  the  translational  process. 


Project  3:  Targeted  Peptide-based  Systemic  Delivery  of  Therapeutic  and  Imaging  Agents 
to  Lung  Cancer 

(PI  and  co-PI:  Renata  Pasqualini,  Ph.D.,  Wadih  Arap,  M.D.,  Ph.D.) 

The  studies  outlined  in  this  proposal  focus  on  the  use  of  peptide  sequences  with  selective  lung 
tumor-targeting  properties.  We  will  seek  to  validate  these  probes  as  delivery  vehicles  in  drug 
and  gene-targeting  approaches.  This  approach  directly  selects  in  vivo  for  circulating  probes 
capable  of  preferential  homing  into  tumors.  The  strategy  will  be  to  combine  homing  peptides  in 
the  context  of  phage  as  gene  therapy  vectors.  Given  that  many  of  our  peptides  also  target 
angiogenic  vasculature  in  addition  to  tumor  cells,  these  studies  are  likely  to  enhance  the 
effectiveness  of  therapeutic  apoptosis  induction  and  imaging  technology. 


Aim  1  To  select  peptides  targeting  primary  and  metastatic  tumors  in  lung  cancer 
patients. 

Summary  of  Research  Findings 

Phage  display  screening  on  human  cancer  cells  from  the  NCI-60  panel  of  cell  lines  suggested 
that  the  tripeptide  motif  GGS  specifically  targets  human  lung  cancer  cells. 
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Standard  BLAST  homology  search  and  detailed  protein  sequence  alignment  of  peptides 
displaying  the  motif  GGS  indicated  members  of  the  ephrin  family  of  proteins  as  the  proteins 

potentially  mimicked  by  GGS- 
containing  peptides.  To  validate 
this  hypothesis,  we  tested  a 
series  of  phage  displaying 
distinct  peptides  containing  the 
GGS  domain  for  its  ability  to  bind 
to  EphA5  receptor.  EphA4  and 
BSA  were  used  as  controls 
(Fig.  1 ).  Phage  displaying  the 
peptide  sequences 

CSGIGSGGC  and 

CRFESSGGC  showed  specific 
binding  to  the  candidate  receptor 
and  were  selected  for  further 
characterization.  Competition 
assays  with  synthetic  peptide 
confirmed  binding  specificity 
(Fig.  2). 


Figure  1: 
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Figure  2:  Binding  of  targeted  phage  is  inhibited  by  synthetic  peptides.  Black  line  indicate  specific  inhibition. 
The  unrelated  peptide  CARAC  and  insertless  phage  were  used  a  s  controls.  A,  CSGIGSGGC-phage. 

B,  CRFESSGGC-phage. 
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We  also  tested  binding  of  the  selected  phage  to  the  endogenous  receptor  expressed  on  the 
surface  of  lung  cancer  cells.  Receptor  expression  was  first  confirmed  by  immunofluorescence  of 
two  lung  cancer  cell  lines:  H460  and  Hop92.  The  ovarian  cancer  cell  line  OVCAR-3  and  EphA4 
were  used  as  negative  controls.  As  shown  in  Figure  3A,  lung  cancer  cell  lines  express  high 
levels  of  EphA5  on  their  surface.  Moreover,  phage  binding  was  specific  to  EphA5-expressing 
cell  lines  (Fig.  3B). 


Lung  cancer  cell  lines  Control  cell  line 


NCI-60  cell  line 

Figure  3:  (A)  immunofluorescence  staining  of  lung  cancer  cells  with  anti-EphA5  receptor.  The  ovarian  cancer  cell  line 
OVCAR-3  was  used  as  negative  control  for  EphA5  expression.  (B)  binding  of  EphA5 -targeting  phage  to  lung  cancer 
cells. 


In  vitro  functional  properties  of  EphA5-tarqetinq  peptides 

We  established  in  vitro  assays  to  study  the  biological  effects  of  EphA5  targeting  peptides  on 
human  lung  cancer  cells.  Receptor  phosphorylation  was  induced  with  the  mimicked  ligand  in 
presence  or  absence  of  the  competing  peptide.  We  observed  inhibition  of  EphA5 
phosphorylation  in  the  presence  of  both  CRFESSGGC  and  CSGIGSGGC  synthetic  peptides 
(Fig.  4). 


Figure  4:  Phosphorylation  of  EphA5  receptor  is  inhibited  by  targeted  peptides  (peptide 
41:  CRFESSGGC;  peptide  94:  CSGIGSGGC).  EPHAN5:  ligand  candidate.  BSA  was 
used  as  negative  control.  9S 


Targeted  peptides  were 
able  to  inhibit 
phosphorylation  of 

ERK1/2  (p44/p42)  (Fig. 
5),  indicating  suppression 
of  the  MAP  kinase 
pathway  and  the 
involvement  of  EphA5  in 
the  maintenance  of  lung 
cancer  cells  proliferation 
and  survival.  The  same 
effect  was  observed  in 
the  phosphorylation  of 
JNK,  supporting  the  role 
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of  EphA5  in  tumor  cell  proliferation/survival.  Consistently,  we  did  not  observe  alterations  in  the 
phosphorylation  of  pathways  involved  in  cell  migration  (phosphorylation  of  p38). 


Figure  5:  Activation  of  MAP  kinase  pathway  was  analyzed  in  presence  or  absence  of  targeted  peptides.  Arrows  point  to 
phosphorylated  ERK1/2  and  JNK.  Inhibition  of  phosphorylation  was  observed  in  both  cases.  We  did  not  detect  alterations  in 
the  phosphorylation  levels  of  P38. 


As  mentioned  above,  peptides  selected  from  phage  libraries  have  allowed  the  cloning  of 
vascular  receptors  in  non-proliferating  and  in  angiogenic  blood  vessels.  Although  the  biological 
basis  for  vascular  heterogeneity  is  unknown,  peptides  selected  by  homing  to  blood  vessels  in 
mouse  models  have  been  used  by  several  groups  as  carriers  to  guide  the  delivery  of  cytotoxic 
drugs,  pro-apoptotic  peptides,  metalloprotease  inhibitors,  cytokines,  fluorophores,  and  genes. 
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Generally,  coupling  of  a  given  entity  to  homing  peptides  yields  targeted  compounds  that  are 
more  effective  and  less  toxic  than  the  parental  entity.  Moreover,  vascular  receptors 
corresponding  to  the  selected  peptides  have  been  identified  in  blood  vessels  of  normal  organs 
and  tumors.  These  results  show  that  it  is  possible  to  develop  therapeutic  strategies  based  on 
selective  expression  of  vascular  receptors. 

Aim  2  To  validate  receptors  for  targeting  human  lung  cancer. 

Summary  of  Research  Findings 

Homing  of  targeted  phage  to  orthotopic  human  lung  cancer  model. 

We  examined  the  expression  of  EphA5  in  samples  of  human  lung  cancer  (Fig.  6).  We  observed 
over-expression  of  EphA5  in  non-small  lung  adenocarcinoma  epithelium  (indicated  by  the 
arrows). 

Second,  to  evaluate  the  targeting  properties  of  the  selected  peptides  in  vivo,  we  administered 
targeted  phage  intravenously  into  human  lung  cancer-bearing  mice.  Lung  and  control  organs 
were  harvested  and  examined  by  IHC  for  presence  of  phage  particles  after  1  h,  6h,  and  24h  of 
systemic  circulation.  We  observed  specific  homing  of  phage  to  lung  cancer  colonies  in  all 


Figure  6:  Human  lung  adenocarcinoma  (pictured  left)  and  control  tissue  (prostate,  pictured  right). 


time  points  (Fig.  7,  arrows).  We  did  not  detect  phage  in  normal  lung;  uptake  of  phage  was  found 
to  be  restricted  to  lung  cancer  cells.  Insertless  phage  was  used  as  negative  control  and  showed 
only  background  staining  (Fig.  7). 
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Figure  7:  Homing  of 
targeted  phage  to 
orthotopic  model  of 
human  lung  cancer. 
Arrows  point  to  cancer 
cells  targeted  by 
phage.  Insertless  phage 
was  used  as  negative 
control  and  showed 
only  background 
staining. 


Treatment  of  orthotopic  human  lung  cancer  with  BMTP-11  and  BMTP-78. 

Here  we  analyzed  the  effect  of  BMTP-1 1  and  BMTP-78  in  human  lung  cancer-bearing  mice. 
Animals  (15  animals/group)  were  treated  weekly  with  lOmg/kg  of  each  compound  (ip)  for  the 
period  of  4  weeks.  Control  group  did  not  receive  any  type  of  treatment.  At  the  end  of  treatment, 
lungs  and  control  organs  were  collected  for  (i)  tumor  colony  count  and  (ii)  tissue  weight.  We 
observed  significant  reduction  of  both  tissue  weight  and  tumor  growth  in  animals  treated  with 
BMTP-11  and  BMTP-78  (Fig.  8,  p<0.05).  Staining  of  apoptotic  cells  demonstrated  specific  killing 
of  lung  cancer  cells  in  animals  treated  with  both  BMTP-1 1  and  BMTP-78  (Fig.  9). 

Targeted  anti-cancer  drug  development  program:  BMTP-11  IND  and  BMTP-78  pre-IND 

Our  first  MDACC-sponsored  IND,  based  on  BMTP-11,  has  received  a  “safe  to  proceed”  status 
by  the  FDA  in  January  2009.  The  first-in-human  clinical  trial  has  a  projected  activation  date  of 
March  2009.  Our  second  cancer  IND  filing  with  the  FDA  will  focus  on  BMTP78.  These 
peptidomimetics  are  targeted  to  the  IL-11R  and  to  GRP  78.  Extensive  studies  have  been 
performed  and  clearly  established  the  relevance  of  these  targets  in  human  cancer  tissue 
samples  from  nearly  every  type  of  tumor  evaluated  (including  lung  cancer).  These  studies 
indicate  that  these  targets  are  suitable  for  delivery  of  therapies  and  imaging  agents.  Extensive 
toxicology  work  has  been  completed  in  mice  and  cynomolgus  monkeys.  Efficacy  studies  in  lung 
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cancer  models  have  been  extensively  validated.  We  expect  the  upcoming  clinical  studies  to  be 
informative  and  establish  a  firm  foundation  for  evolving  these  drugs  into  Phase  I  I/I  1 1  trials. 

Merging  tumor  targeting  and  molecular-genetic  imaging  into  an  integrated  platform  is  limited  by 
a  lack  of  strategies  to  enable  systemic,  ligand-directed  delivery  and  imaging  of  specific 
transgenes.  Many  eukaryotic  viruses  subserve  transgene  delivery  but  require  elimination  of 
native  tropism  for  mammalian  cells;  in  contrast,  prokaryotic  viruses  can  be  adapted  to  bind  to 
mammalian  receptors  but  are  otherwise  poor  vehicles.  We  have  introduced  a  system 
containing  cis-elements  from  adeno-associated  virus  (AAV)  and  single-stranded  bacteriophage. 
Our  AAV/phage  (AAVP)  prototype  targets  ad  integrin.  We  have  shown  that  AAVP  provides 
superior  tumor  transduction  over  phage  and  that  incorporation  of  inverted  terminal  repeats  is 
associated  with  an  improved  fate  of  the  delivered  transgene.  Moreover,  the  temporal  dynamics 
and  spatial  heterogeneity  of  gene  expression  mediated  by  targeted  AAVP  can  be  monitored  by 
positron  emission  tomography.  This  new  class  of  targeted  hybrid  viral  particles  will  enable  a 
wide  range  of  applications  in  biology  and  medicine.  Evaluation  of  this  vector  for  therapeutic 
delivery  of  TNF  to  tumor-bearing  dogs  revealed  positive  results.  A  clinical  trial  is  planned  to 
bring  this  vector  into  the  clinic. 


Figure  8:  Treatment  of  tumor-bearing  animals  with  BMTP-11  and  BMTP-78.  After  4  weeks  of  treatment,  lungs  of 
treated  and  untreated  animals  were  collect  and  analyzed  for  the  presence  of  lung  cancer  colonies.  A  significant 
reduction  in  lung  weight  and  presence  of  lung  cancer  colonies  was  observed  in  animals  treated  with  both  compounds. 


29 


Army  Award  W81XWH-05-2-0027;  Waun  Ki  Hong,  M.D. 

Annual  Report:  Reporting  Period  15  February  2008  -  14  February  2009 


o 

c 

o 


O 


Q_ 

CO 


00 


co 


Figure  9:  Lung  of  treated  animals  were  analyzed  for  the  presence  of  apoptotic  cells  after  treatment 
with  BMTP-11  and  BMTP-78.  Arrows  point  to  lung  cancer  colonies  containing  apoptotic  lung 
cancer  cells  (brown  staining). 


Phaqe-based  direct  assembly  of  gold  nanoparticles  and  NIR  applications 
The  integration  of  phage  display-based  combinatorial  tissue  targeting  and  nanotechnology  has 
emerged  from  methodology  we  have  established  for  the  direct-assembly  of  gold  (Au) 
nanoparticles  onto  phage.  Given  the  challenges  for  reproducibly  building  at  the  nanometer 
scale,  including  the  need  for  streamlined  and  “bottom-up”  approaches  for  assembling 
nanoparticle  architectures,  the  combination  of  phage  and  Au  nanoparticles  is  an  ideal  system  to 
gather  the  required  expertise  to  control  the  fabrication  and  application  of  biological  assisted 
nano-assemblies.  This  work  was  motivated  from  the  premise  that,  in  nature,  the  direct-assembly 
of  molecules  and  particles  is  often  directed  by  non-specific  hydrophobic,  van  der  Walls,  and/or 
electrostatic  interactions.  We  hypothesized  that  the  assembly  of  phage  and  Au  nanoparticles 
may  also  occur  spontaneously  through  similar  interactions.  The  outcome  from  testing  this 
hypothesis  was  the  design  and  validation  of  a  method  for  Au-phage-based  nano-assembly 
without  genetic  manipulation  or  complex  conjugation  chemistry.  We  have  generated  stable  and 
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biologically  active  networks  of  direct-assembled  Au-phage  scaffolds,  in  which  we  can  tune  the 
chemical  and  physical  properties  of  these  biological  structures.  This  tuning  capability  combined 
with  the  programmed  tissue  targeting  property  of  the  phage  has  proven  most  valuable,  as  it  has 
allowed  us  to  integrate  multiple  functions  in  a  single  nano-assembly,  serving  as  a 
complementary  and  non-mutually  exclusive  tool  among  applications,  i.e.,  near  infrared  (NIR) 
surface  enhanced  Raman  scattering  (SERS)  detection,  Au  enhanced  fluorescence  in  vitro 
imaging,  MRI  and  CT  scan  in  vivo  imaging,  and/or  heat  deposition  for  NIR  photo-therapy.  The 
identification  of  vascular  markers  targeted  by  circulating  ligands  continues  to  shed  light  on  the 
complex  cellular  and  molecular  diversity  of  the  human  vasculature.  In  the  near  future,  the 
integration  of  the  molecular  diversity  of  blood  vessels  with  nanotechnology  will  be  translated  into 
clinical  applications. 

Aim  3  To  design  tools  for  molecular  imaging  of  lung  tumors. 

Summary  of  Research  Findings 

In  this  Aim,  we  hypothesized  that  molecular  addresses  within  lung  tumors  can  be  exploited  for 
imaging  and  early  detection  of  diseased  sites;  our  goal  was  to  use  these  biochemical 
differences  to  develop  targeted  imaging  agents.  We  will  develop  novel  tools  for  intravital 
imaging  at  the  molecular  level  in  collaboration  with  the  Imaging  Core.  These  devices  will  be 
based  on  labeled  peptides  and  engineered  phage  particles  for  targeted  systemic  delivery  of 
thymidine  kinase.  The  most  efficient  system  will  be  tested  and  validated  in  vivo,  in  animal 
models,  and  ultimately  in  patients.  We  look  to  continue  these  efforts  during  the  next  project 
period. 

Key  Research  Accomplishments 

•  Tested  peptides,  selected  by  homing  to  blood  vessels  in  mouse  models,  as  carriers  to  guide 
the  delivery  of  cytotoxic  drugs,  pro-apoptotic  peptides,  metalloprotease  inhibitors,  cytokines, 
fluorophores,  and  genes. 

•  Identified  vascular  receptors  that  correspond  to  the  selected  peptides  found  in  blood  vessels 
of  normal  organs  and  tumors. 

•  Received  “safe  to  proceed”  status  by  the  FDA  on  our  BMTP-1 1 -based  IND.  Human  clinical 
trials  will  be  activated  by  March  2009. 

•  Filed  a  second  IND  with  the  FDA  to  focus  on  BMTP78. 

•  Generated  stable  and  biologically  active  networks  of  direct-assembled  Au-phage  scaffolds, 
in  which  we  can  tune  the  chemical  and  physical  properties  of  these  biological  structures. 

•  Identified  vascular  markers  targeted  by  circulating  ligands  to  further  our  knowledge  of 
complex  cellular  and  molecular  diversity  of  the  human  vasculature. 

Conclusions 


We  propose  that  the  EphA5  receptor  expressed  on  the  surface  of  lung  cancer  cells  controls 
cells  proliferation  and  survival  by  activating  the  MAP  kinase  signaling.  We  showed  that  our 
EphA5-targeted  peptides  inhibit  phosphorylation  of  ERK1/2  and  JNK.  These  effects  would 
potentially  result  in  inhibition  of  lung  cancer  cell  proliferation  and  induction  of  apoptosis.  Future 
studies  will  allow  us  to  use  shRNA  to  inhibit  the  expression  of  EphA5  in  lung  cancer  cells  and  to 
analyze  cell  proliferation  and  survival  in  vitro  and  in  vivo.  We  will  over-express  EphA5  receptor 
in  normal  cells  and  analyze  whether  the  expression  of  the  receptor  will  cause  cell  transformation 
in  vitro  and  in  vivo.  We  will  also  treat  lung  cancer  tumor-bearing  mice  with  both  systemic 
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injection  and  nasal  inhalation  of  peptide.  These  will  follow  optimization  studies  for  peptide 
stability,  the  use  of  liposome  as  carriers,  and  a  dose-escalation  experiment. 


Project  4:  Inhibition  of  bFGF  Signaling  for  Lung  Cancer  Therapy 

(PI:  Reuben  Lotan,  Ph.D.) 

The  survival  of  lung  cancer  patients  is  poor  because  this  cancer  is  diagnosed  at  advanced 
stages.  Therefore,  improvements  in  early  detection  through  the  identification  of  molecular 
markers  for  diagnosis  and  for  intervention  combined  with  targeted  chemoprevention  are  urgently 
needed.  While  the  molecular  events  involved  in  lung  cancer  pathogenesis  are  being  unraveled 
by  ongoing  large  scale  genomics,  proteomics,  and  metabolomics  studies,  it  is  already  well 
recognized  that  proliferation-,  survival-  and  angiogenesis-  promoting  signaling  pathways  are 
amplified  in  lung  cancer.  Among  the  angiogenesis  signaling  pathways,  the  basic  fibroblast 
growth  factor  (bFGF)  and  its  transmembrane  tyrosine  kinase  receptors  (FGFRs)  are  playing 
important  roles  in  addition  to  the  well-studied  vascular  endothelial  growth  factor  (VEGF)  and  its 
receptors  (VEGFRs).  Both  types  of  angiogenesis  signaling  pathways,  the  VEGFA/EGFR  and 
the  bFGF/FGFR,  have  been  detected  in  NSCLC  and  associated  with  lung  cancer  development. 
However,  most  efforts  in  preclinical  and  clinical  trials  have  been  directed  to  the  VEGF/VEGFR 
pathway. 

We  hypothesize  that  bFGF  triggers  signaling  pathways  that  contribute  to  malignant  progression 
of  lung  cancers  by  stimulating  tumor  cell  and  endothelial  cell  proliferation  and  survival  and 
augmenting  angiogenesis.  Therefore,  agents  that  intervene  in  this  pathway  may  be  useful  for 
lung  cancer  therapy  either  alone  or  in  combination  with  agents  that  target  the  VEGF/VEGFR 
signaling  pathways  and/or  with  cytotoxic  agents.  We  will  address  the  following  specific  aims  in 
order  to  understand  the  mechanism(s)  underlying  the  in  vitro  and  in  vivo  effects  of  bFGF  on  lung 
cancer  and  endothelial  cells  and  the  ability  of  bFGF  inhibitors  to  suppress  the  growth  of  NSCLC  in 
vitro  and  in  vivo. 


Aim  1  Determine  the  effects  of  bFGF  on  in  vitro  growth,  survival,  motility,  invasion 
and  angiogenesis  of  NSCLC  cells  and  endothelial  cells. 

Summary  of  Research  Findings 

This  aim  was  completed  and  summarized  in  the  previous  reports. 

Aim  2  Evaluate  the  relative  potency  of  several  inhibitors  of  bFGF  binding  to  receptor 
(i.e.,  TMPP  and  analogs)  in  inhibiting  effects  of  bFGF  detected  in  Specific  Aim 
1  and  evaluate  the  effects  of  these  inhibitors  in  combination  with  paclitaxel  on 
in  vitro  growth  and  survival  of  tumor  cells. 

Summary  of  Research  Findings 

Agents  tested  during  previous  reporting  periods  for  inhibition  of  FGFR  signaling  were  not 
available  in  sufficient  amounts  needed  for  use  in  in  vivo  experiments  (as  detailed  in  the  update 
for  specific  aim  3  below),  so  we  tested  the  potency  of  additional  agents  that  target  FGF  receptor 
signaling.  The  first  agent  was  5-Deoxy-5-(methylthio)adenosine  (MTA),  a  naturally  occurring 
sulfur-containing  nucleoside  present  in  all  mammalian  tissues.  We  have  chosen  to  test  MTA 
because  it  has  been  shown  to  specifically  inhibit  FGF-2-induced  fibroblast  proliferation  through 
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binding  to  the  FGF  receptor  and  inhibiting  FGF-2-induced  FGFR  tyrosine  kinase  activity  and 
autophosphorylation  (Fig.  1).  Another  reason  for  our  interest  in  this  agent  was  the  report  that 
MTA  can  inhibit  the  accumulation  of  the  high  molecular  weight  species  of  bFGF  (M.  wt.  of  24, 
22.5,  and  22  kDa)  in  the  cell  nucleus  by  suppressing  methylation-dependent  post-translational 
modification  (Fig.  2).  We  found  that  MTA  inhibited  the  growth  of  the  lung  tumorigenic  cell  line 
1170-1  in  a  dose-dependent  fashion  with  50%  inhibition  that  was  achieved  with  about  100  pM 
MTA  (Fig.  1  A).  The  inhibition  was  due  to  suppression  of  the  mitogenic  effects  of  bFGF  by  MTA 
as  indicated  by  inhibition  of  the  increases  S  phase  (DNA  synthesis)  caused  by  bFGF  alone 
compared  to  bFGF  plus  MTA  (Fig.  IB). 


1170-1  cells  were  treated  with  MTA  for  56h 


Figure  1:  Inhibition  of  the  growth  of  1 170-1  cells  by  MTA.  (A)  Cells  were  treated  with  the  indicated  doses  of  MTA  for  72  hours 
and  then  cell  numbers  were  assessed  by  the  sulforhodamine  B  assay  in  triplicates.  (B)  Cell  cycle  analysis  was  performed  after  56 
hours  of  cell  growth  with  no  additives  (control)  or  only  bFGF,  or  MTA  plus  bFGF.  The  cells  were  then  stained  with  propidium 
iodide  and  processed  for  cell  cycle  analysis  by  flow  cytometry. 


The  second  agent,  Sanofi-Aventis  compound  SSR128129E,  was  tested  at  concentrations  of  up 
to  200  nM  for  1  to  3  days,  with  and  without  30  ng/ml  of  bFGF  using  the  following  lung  tumor  cell 
lines:  1170-1,  H441, 2279,  PC14PE6,  1799  and  BEAS-2B.  We  found  no  inhibition  of  growth  with 
any  of  these  cell  lines;  we  aborted  the  plans  to  use  this  agent.  Later,  we  were  informed  that 
Sanofi-Aventis  had  canceled  their  own  plans  to  develop  this  compound. 

A  third  strategy  was  an  alternative  to  using  small  molecular  weight  inhibitors.  We  previously 
concluded  that  infection  of  lung  cancer  cells  with  adenoviral  vectors  that  harbor  a  dominant 
negative  form  of  FGF  Receptor  (DNFGFR1)  inhibited  cell  growth.  We  have  expanded  this 
approach  and  found  that  the  expression  of  DNFGFR1  inhibited  the  growth  of  several  lung 
cancer  cell  lines  as  a  function  of  the  amount  of  the  virus  used  for  infection  expressed  in  plaque¬ 
forming  units  (PFU)  per  cell  (Figs.  2  and  3).  The  results  show  that  expression  of  DNFGFR1 
inhibits  the  growth  of  all  the  cell  lines  tested  and  induces  cell  death  as  indicated  by  cell 
detachment  (Fig.  2D),  DNA  fragmentation  detected  by  the  tunnel  assay  of  apoptosis  (Fig.  2E), 
and  cleavage  of  PARP  (Fig.  2E).  Colony  formation  in  agarose  was  also  suppressed  by  76% 
after  treating  the  cells  with  the  DNFGFR1  viral  vector.  The  expected  suppression  of  FGFR1 
kinase  activity  was  confirmed  by  immunoprecipitation/Western  blotting  analysis  (Fig.  2D). 
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Figure  2.  (A)  1170-1  tumorigenic  lung  cells  were  infected  with  adenoviral  vector  harboring  control  construct-  Green 
fluorescent  protein  (GFP)  at  10,000  PFU/cell)  or  virus  harboring  DNFGFR1  at  1000,  5000,  or  10000  PFU/cell.  The  cell 
number  was  determined  after  24,  48  and  72  hours.  (B)  The  appearance  of  the  cells  treated  as  in  A  under  the  microscope. 
Note  the  rounded  floating  dead  cells  in  culture  treated  with  DNFGFR1  virus  for  72  hr.  (C)  The  1170-1  cells  were  treated 
with  lOOOOPFU/ml  virus  with  GFP  or  DNFGFR1  and  then  assayed  for  colony  formation  in  agarose.  Colony  numbers 
were  scored  after  6  days  under  the  microscope.  Average  of  4  fields  is  presented  ±SE.  (D)  1 170-1  cells  were  infected  with 
adenoviral  vector  containing  GFP  or  DNFGFR1  and  treated  with  10  ng/ml  bFGF  for  30  min  before  cell  lysis  and 
immunoprecipitation  with  anti-FGFRl  antibodies.  The  immunoprecipitates  were  subjected  to  slab  gel  electrophoresis  and 
immunoblotting  with  anti-phosphotyrosine  antibodies.  (E)  Cells  were  analyzed  by  the  TUNEL  assay  and  PARP  cleavage 
for  apoptotic  cell  death. 


Studies  on  the  mechanism  by  which  DNFGFR1  acts  to  inhibit  the  growth  and  induce  apoptosis 
in  lung  tumor  cells  are  ongoing.  Preliminary  results  indicate  that  DNFGFR1  increases  the  cell 
cycle  inhibitors  p21  and  p27  and  decreases  the  cell  cycle  regulators  cdc25  and  Chkl. 
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Figure  3:  Lung  cancer  cell  lines  A549,  HI 299,  and  HI 792  were  infected  with  control  virus  (GFP) 
or  virus  containing  DBFGFR1  at  500,  2000,  or  5000  PHU/cell)  and  the  number  of  cells  was 
determined  after  24,  48,  and  72  hours. 


Aim  3  Evaluate  anti-tumor  activity  (growth  inhibition,  apoptosis,  suppression  of 
angiogenesis)  of  the  most  effective  inhibitor  identified  in  Specific  Aim  2  when 
used  alone  and  in  combination  with  paclitaxel  in  an  orthotopic  lung  cancer 
model  using  luciferase-expressing  NSCLC  cells  for  in  vivo  bioluminescence 
imaging  of  tumor  growth  and  response  to  treatment. 

Summary  of  Research  Findings 

The  most  appropriate  way  to  pursue  our  in  vivo  studies,  as  a  prelude  to  potential  clinical  trials,  is 
to  secure  a  large  source  of  the  drug  from  our  pharmaceutical  counterparts.  Therefore,  from  the 
beginning  of  this  project,  we  tried  to  identify  agents  that  target  FGF  receptor  signaling  and  are 
under  development  in  pharmaceutical  companies.  We  had  identified  three  companies  and 
tested  two  of  the  compounds  that  they  had  agreed  to  provide.  Initially,  we  examined  the 
compound  5,10,15,20-tetrakis(methyl-4-pyridyl)-21FI,23FI-porphine-tetra-p-tosylate  salt  (TMPP), 
which  showed  inhibitory  effects  against  lung  tumor  cells,  as  reported  previously.  However,  the 
company  (Prochon,  Rehovot,  Israel),  which  was  the  source  of  this  agent,  decided  not  to  pursue 
the  development  of  this  compound.  We  contacted  Sanofi-Aventis  after  learning  that  they  had  an 
FGFR  inhibitor  (SSR128129E).  We  found  that  this  compound  was  ineffective  in  any  of  our  cell 
lines  at  concentrations  that  were  relevant  for  in  vivo  studies.  Therefore,  we  stopped  further 
experiments  with  this  agent,  and,  as  noted,  Sanofi-Aventis  has  also  stopped  development  of  this 
agent  due  to  toxicity. 

We  contacted  Bristol-Mayer-Squibb,  who  had  a  dual  vascular  growth  factor  receptor  (VEGFR)  / 
Fibroblast  growth  factor  receptor  (FGFR)  inhibitor  in  development,  but  we  were  not  able  to  get 
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this  material  for  in  vitro  testing.  Therefore,  we  were  not  able  to  carry  out  in  vivo  experiments 
with  these  small  molecules.  As  an  alternative,  we  examined  two  potential  approaches  that  were 
described  above  under  specific  aim  3,  namely  the  FGFR  inhibitor  5’-(methylthio)adenosine  and 
the  adenoviral  vector  expressing  dominant  negative  FGF  receptor. 

Aim  4  To  investigate  the  expression  of  bFGF  signaling  components  (bFGF,  FGFR-1, 
FGFR-2,  heparan  sulfate,  syndecan-1,  and  FGFR-3)  by  IHC  staining  of  tissue 
microarrays  (TMAs),  and  correlate  the  expression  of  bFGF/bFGFRs  between 
tumor  and  non-malignant  epithelial  cells  with  angiogenesis. 

Summary  of  Research  Findings 

During  the  last  year,  we  completed  the  analysis  of  the  expression  patterns  of  basic  fibroblast 
growth  factor  (bFGF)  and  FGF  receptors  1,  in  NSCLC  tumors,  and  premalignant  lesions 
including  squamous  metaplasia  and  dysplasia.  The  publication  of  these  results  required  some 
additional  experiments  (Behrens  et  al,  Clinical  Cancer  Research  2008);  final  results  have 
demonstrated  high  expression  of  bFGF,  FGFR1,  and  FGFR2  in  most  NSCLC  tumors.  The 
pattern  of  expression  for  all  markers  varied  according  to  tumor  histologic  type  and  cellular 
localization.  Cytoplasmic  expression  scores  were  significantly  higher  in  tumors  than  in  normal 
epithelia.  Nuclear  bFGF  and  FGFR1  levels  were  significantly  higher  in  women  than  in  men. 
Although  cytoplasmic  FGFR1  expression  was  significantly  higher  in  smokers  than  in  non 
smokers,  nuclear  FGFR1  and  FGFR2  expression  was  significantly  higher  in  non  smokers. 
Dysplastic  changes  showed  significantly  higher  expression  of  all  markers  compared  with 
squamous  metaplasia. 

Key  Research  Accomplishments 

•  Suppressed  the  growth  of  lung  tumor  cells  by  repressing  FGFR  activation  with  the  FGF 
receptor  inhibitor  MTA. 

•  Demonstrated  that  dominant  negative  FGF  receptor  adenoviral  expression  vector  inhibits 
the  growth  of  lung  cancer  cell  lines  by  inducing  apoptosis. 

•  Concluded  that  bFGF,  FGFR1 ,  and  FGFR2  are  highly  expressed  in  most  NSCLC  tumors. 

Conclusions 


During  the  period  of  performance,  we  have  shown  that  MTA,  as  an  effective  FGFR  inhibitor,  is  a 
potent  cell  growth  suppressor  and  apoptosis  inducer,  and  should  be  further  evaluated  in 
preclinical  setting  for  anti  tumor  effects.  Our  research  data  demonstrated  that  MTA  inhibited  the 
growth  of  the  lung  tumorigenic  cell  line  1170-1  in  a  dose-dependent  fashion  with  50%  inhibition 
that  was  achieved  in  our  lab.  These  findings  led  us  to  study  adenoviral  vectors  harboring 
dominant  negative  FGFR1  as  potential  candidates  for  preclinical  in  vivo  studies  with  aerosolized 
viral  formulation.  Studies  on  the  mechanism  by  which  DNFGFR1  acts  to  inhibit  the  growth  and 
induce  apoptosis  in  lung  tumor  cells  are  ongoing.  Our  preliminary  results  indicate  that 
DNFGFR1  increases  the  cell  cycle  inhibitors  p21  and  p27  and  decreases  the  cell  cycle 
regulators  cdc25  and  Chkl .  We  look  to  complete  these  studies  during  the  next  project  period  to 
confirm  that  the  bFGF  signaling  pathway  activation  may  be  an  early  indicator  of  SCC 
pathogenesis  and  a  novel  target  for  lung  cancer  chemopreventive  and  therapeutic  strategies. 
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Project  5:  Targeting  mTOR  and  Ras  signaling  pathways  for  lung  cancer  therapy 

(Project  Co-leaders:  Fadlo  R.  Khuri,  M.D.,  Shi-Yong  Sun,  Ph.D.) 


Aim  1  To  determine  whether  an  mTOR  inhibitor  inhibits  the  growth  of  human  NSCLC 
cells  via  G1  growth  arrest  or  induction  of  apoptosis,  and  to  identify  the 
molecular  determinants  of  mTOR  inhibitor  sensitivity. 

This  aim  was  completed  and  summarized  in  the  previous  reports. 

Aim  2  To  determine  whether  the  effect  of  mTOR  inhibitors  on  the  growth  of  human 
NSCLC  cells  is  enhanced  in  the  presence  of  a  PI3K  inhibitor  or  a  MAPK 
inhibitor. 

This  aim  was  completed  and  summarized  in  the  previous  reports. 

Aim  3  To  evaluate  the  efficacies  of  the  combinations  of  rapamycin  with  LY294002  or 
U0126  in  nude  mice  models  of  lung  cancer  xenografts  in  vivo. 

Summary  of  Research  Findings 

Results  from  the  previous  reporting  period  showed  that  mTOR  inhibitors  increase  elF4E 
phosphorylation  in  cultured  lung  cancer  cells.  Moreover,  we  determined  whether  mTOR 
inhibitors  increased  p-elF4E  levels  in  vivo.  We  further  detected  p-elF4E  levels  in  lung 
xenorgafts  in  mice  exposed  to  RAD001  for  14  days  using  both  Western  blotting  and 
immunohistochemistry  (IHC).  As  presented  in  Figure  1A,  p-elF4E  levels  in  both  A549  and  H460 
xenografts  treated  with  RAD001  detected  by  the  Western  blotting  were  significantly  higher  than 
those  treated  with  vehicle  control  (p  <0.001).  By  IHC,  we  found  that  five  out  of  six  A549 
xenograft  tumors  did  not  or  expressed  very  low  levels  of  p-elF4E.  However,  six  tumors  exposed 
to  RAD001  for  14  days  exhibited  increased  levels  of  p-elF4E  (with  various  degrees)  that  were 
significantly  higher  than  vehicle  control  tumors  (Fig.  IB;  P  <  0.01).  Collectively,  these  results 
indicate  that  RAD001  increases  elF4E  phosphorylation  in  vivo  in  lung  xenografts. 
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Figure  1:  RAD001  increases  p-eIF4E 
levels  in  lung  cancer  xenografts 
evaluated  with  either  Western  Blot 
analysis  (A)  or  IHC  (£).  A549  cells  at  5 
x  1 06  in  serum- free  medium  were 
injected  s.c.  into  the  flank  region  of 
nude  mice.  When  tumors  reached 
certain  size  ranges  (-100  mm3),  the 
mice  were  randomized  into  two  groups 
(n  =  6/group)  for  the  following 
treatments:  vehicle  control  and 
formulated  RAD001  (4  mg/kg/day,  oral 
gavage).  After  14  days,  the  mice  were 
sacrificed  and  the  tumors  were 
removed,  frozen  (A)  or  fixed  (B).  A,  p- 
eIF4E  was  evaluated  with  Western  blot 
analysis  (upper  panels)  and  the  data 
was  scanned  and  quantitated  using  NHI 
Image  J  software  (lower  panels).  B ,  p- 
eIF4E  levels  were  detected  using  IHC 
and  scored  as  a  weight  index 
(percentage  x  intensity)  (left  panel). 
Representative  pictures  of  p-eIF4E 
expression  (100X)  were  presented 
(right  panels).  The  difference  between 
vehicle  and  RAD001  treatments  was 
analyzed  by  unpaired  student  t  test  with 
or  without  Welch  correction. 


In  addition  to  induction  of  Akt  phosphorylation  by  mTOR  inhibitors,  we  found  that  mTOR 
inhibition  also  induced  activations  of  the  MEK/ERK  signaling  pathway  in  some  cancer  cell  lines 
after  a  prolonged  treatment.  The  combination  of  rapamycin  with  the  MEK  inhibitor  U0126 
significantly  enhanced  growth  inhibitory  effects  of  cancer  cells,  suggesting  that  MEK/ERK 
activation  may  counteract  mTOR  inhibitors’  anticancer  efficacy.  Similarly,  the  combination  of  an 
mTOR  inhibitor  with  the  EGF  receptor  inhibitor  erlotinib  synergistically  inhibited  the  growth  of 
both  human  cancer  cells  in  cell  cultures  and  xenografts  in  nude  mice  (Fig.  2).  Moreover,  the 
presence  of  erlotinib  suppressed  rapamycin-induced  phosphorylation  of  Akt,  ERK  and  elF4E  as 
well,  implying  that  erlotinib  can  suppress  mTOR  inhibition-induced  feedback  activation  of 
several  survival  signaling  pathways  including  Akt,  ERK  and  elF4E.  Thus,  we  suggest  a 
therapeutic  strategy  for  enhancing  mTOR-targeted  cancer  therapy  by  preventing  mTOR 
inhibition-induced  feedback  activation  of  several  survival  mechanisms.  These  reportable 
outcomes  were  published  in  Cancer  Biology  and  Therapy  (Wang  et  al.  2008). 
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Figure  2:  The  combination  of  mTOR  inhibition  and 
EGFR  inhibition  exhibits  enhanced  anticancer 
efficacy  in  vivo.  Four  groups  of  mice  with  either  A549 
xenografts  were  treated  with  vehicle  control,  RAD001 
(3  mg/kg/day)  alone,  erlotinib  (80  mg/kg/day)  alone, 
and  RAD001  plus  erlotinib  on  the  same  day  after 
grouping.  After  14  days,  the  mice  were  sacrificed.  The 
tumors  were  then  removed  and  weighed  ( B ).  During 
the  treatments,  tumor  sizes  were  measured  once  every 
two  days  (A).  Tumor  sizes  treated  with  erlotinib  alone 
and  RAD001  plus  erlotinib  were  significantly  smaller 
than  vehicle  control  treatment  at  all  measurement 
times  (p  <  0.01).  Tumor  sizes  treated  with  RAD001 
alone  were  significantly  smaller  than  vehicle  control 
treatment  at  days  10,  12,  and  14  (p  <  0.01).  #p<  0.05 
compared  to  either  RAD00 1  or  erlotinib  alone 
treatment.  The  data  are  means  ±  SE  of  6  tumors  from 
6  mice. 


As  noted  above,  the  mTOR  inhibitor  RAD001  increase  p-elF4E  levels  in  vivo  (Fig.  1).  In 
xenografts  treated  with  the  combination  of  RAD001  and  erlotinib,  elF4E  phosphorylation  was 
not  increased  compared  with  RAD001 -treated  tumors.  Accordingly,  the  combination  of  RAD001 
and  erlotinib  was  more  potent  than  each  agent  alone  in  inhibiting  the  growth  of  lung  cancer 
xenografts.  Additionally,  we  noted  that  the  expression  of  Mcl-1,  which  is  regulated  by  cap- 
dependent  translation  mechanism  and  important  for  elF4E-mediated  oncogenesis,  was  slightly 
higher  in  RAD001 -treated  tumors,  but  significantly  inhibited  in  lung  tumors  treated  with  the 
combination  of  RAD001  and  erlotinib.  We  suggested  that  erlotinib  enhances  RADOOTs 
anticancer  activity  in  vivo  involving  abrogation  of  mTOR  inhibition-induced  elF4E 
phosphorylation.  Modulation  of  elF4E  phosphorylation  may  serve  as  a  predictive  biomarker  for 
RADOOI/erlotinib-based  cancer  therapy.  This  part  of  the  data  will  be  presented  in  the  upcoming 
2009  AACR  meeting.  The  in  vivo  results  on  the  combination  of  RAD001  and  LY294002  in  lung 
cancer  xenografts,  which  were  summarized  in  the  last  year’s  report,  have  been  published  in 
Cancer  Res  (Wang  et  al.  2008). 

Aim  4  To  conduct  a  pilot  clinical  biochemical  induction  trial  to  investigate  the  effect  of 
RAD001  in  operable  NSCLC  patients  and  identify  molecular  determinants  of 
RAD001  sensitivity  and  prognosis. 

Summary  of  Research  Findings 

The  Phase  lb  protocol  was  activated  effective  February  2007.  After  screening  7  eligible 
patients,  with  6  of  7  declining  due  to  the  randomization  to  placebo,  we  successfully  amended 
the  study  to  remove  the  placebo  arm.  Approval  was  obtained  from  the  FDA,  Department  of 
Defense  Human  Protection  Committees  as  well  as  Emory’s  IRB,  and  we  have  now  accrued  8 
total  patients  on  this  trial.  One  of  these  8  patients,  3  patients  reported  drug  discontinuation  due 
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to  grade  3  nausea  and  vomiting  after  8  days.  All  other  patients  have  completed  the  scheduled  3 
weeks  of  therapy  and  have  undergone  surgical  resection.  Overall,  there  has  been  a  decrease  in 
the  FDG  uptake  on  the  PET  scan  in  7  out  of  8  patients  enrolled  to  the  study.  The  quantitative 
reduction  in  the  standardized  uptake  value  (SUV)  ranged  from  20%  to  50%  overall;  one  patient 
experienced  a  slight  reduction  in  tumor  size  with  RAD001.  Since  the  last  2  patients  enrolled  to 
the  study  have  developed  interstitial  pneumonitis,  we  have  temporarily  halted  accrual  (effective 
February  13,  2009)  until  further  information  can  be  obtained  about  these  events.  The  adverse 
events  appear  unlikely  related  to  the  study  drug  and  could  be  a  function  of  the  patient’s  pre¬ 
operative  pulmonary  functions.  We  are  planning  to  analyze  the  pre-  and  post-treatment  tumor 
specimens  for  molecular  markers  in  the  next  few  weeks. 

Key  Research  Accomplishments 

•  Indicated  that  RAD001  increases  elF4E  phosphorylation  in  vivo  in  lung  xenografts. 

•  Demonstrated  that  mTOR  inhibition  induced  activations  of  the  MEK/ERK  signaling  pathway 
in  some  cancer  cell  lines  after  a  prolonged  treatment. 

•  Observed  co-targeting  of  mTOR  and  PI3K/Akt  signaling  exerts  enhanced  anticancer  activity 
in  lung  cancer  xenograft  models.  mTOR  inhibitors  increase  elF4E  phosphorylation  in  lung 
cancer  xenografts. 

•  Modulation  of  elF4E  phosphorylation  may  serve  as  a  predictive  biomarker  for 
RADOOI/erlotinib-based  cancer  therapy. 

•  Accrued  8  patients  on  the  clinical  trial. 

Conclusions 


Targeting  the  mTOR  axis  appears  to  be  a  promising  strategy  against  lung  cancer.  Given  the 
nature  of  the  complexity  of  lung  cancer  signaling  pathways,  including  mTOR  signaling,  it  is 
essential  to  understand  the  biology  of  lung  cancer  and  the  mechanism  of  action  for  the 
therapeutics  of  interest  in  order  to  efficiently  treat  lung  cancer  through  application  of 
mechanism-driven  therapeutic  regimens.  Thus,  we  have  demonstrated  the  scientific  rationale 
for  our  effort  in  pursuing  mTOR-targeted  lung  cancer  therapy. 


Project  6:  Identification  and  Evaluation  of  Molecular  Markers  in  Non-Small  Cell  Lung 
Cancer  (NSCLC) 

(PI  and  co-PI:  Ralf  Krahe,  Ph.D.,  Li  Mao,  M.D) 

A  better  understanding  of  the  lung  cancer  biology  and  an  identification  of  genes  involved  in 
tumor  initiation,  progression  and  metastasis  are  an  important  first  step  leading  to  the 
development  of  new  prognostic  markers  and  targets  for  therapy.  In  the  same  context, 
identification  of  reliable  predictive  markers  for  response  or  resistance  to  therapy  in  NSCLC 
patients  is  also  desperately  desired  for  optimal  delivery  of  targeted  therapy  and/or  standard 
chemotherapy.  The  proposed  studies  aim  to  identify  the  two  types  of  markers  that  would 
eventually  help  develop  smarter  clinical  trials,  which  will  selectively  recruit  patients  who  are 
more  likely  to  respond  to  one  regimen  over  another  and  lead  to  improvement  of  overall 
therapeutic  outcomes. 
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Aim  1  To  expression  profile  by  DNA  microarray  technology  aerodigestive  cancers  - 
with  primary  focus  on  adenocarcinoma  and  squamous  cell  carcinoma  (SCC) 
of  the  lung,  and  head  and  neck  squamous  cell  carcinoma  (HNSCC), 
including  primary  tumors  and  normal  adjacent  tissue,  and  (where  available) 
metastatic  lesions. 

Summary  of  Research  Findings 

During  the  current  project  period,  we  completed  the  expression  profiling  on  8  matched 
tumor/normal  adjacent  tissue  (T/N)  pairs  of  lung  cancers,  5  adenocarcinomas,  and  3  squamous 
cell  carcinomas  (SCC)  from  an  initial  total  of  22  NSCLC  T/N  pairs.  However,  due  to  the  limited 
number  of  samples  and  limited  expression  profiling  obtained,  statistically  meaningful 
conclusions  for  the  lung  cancer  part  of  the  study  and,  therefore,  meaningful  comparisons  with 
the  HNSCC  part  of  the  study  were  not  possible.  We  therefore  decided  to  focus  our  efforts  on 
Aims  2  and  3. 


Aim  2  To  DNA  profile  the  same  samples  by  complementing  DNA  approaches  to 
stratify  RNA  expression  profiles  on  the  basis  of  their  corresponding  DNA 
profiles. 

Summary  of  Research  Findings 

Originally,  we  proposed  to  correlate  DNA  and  RNA  profiles  from  the  21  NSCLCs  gathered  in 
Aim  1.  Due  to  the  issues  related  to  sample  procurement,  it  was  not  feasible  to  acquire  the 
necessary  number  of  matching  RNA/DNA  samples  that  would  result  in  a  significant  statistical 
power.  We  decided  to  gather  DNA  samples  without  matching  RNA  samples  from  108  NSCLC 
patients  with  complete  clinical  and  follow-up  data  (56  adenocarcinomas,  48  SCC,  and  25  other 
lung  cancers)  for  a  total  of  129  primary  tumor  (PT)  and  45  matching  normal  adjacent  tissue 
(NAT/PT)  pairs,  which  we  used  for  genomic  profiling. 

We  performed  genomic  profiling  of  the  same  HNSCC  samples  on  which  we  performed 
transcriptomic  profiling  (see  above  Aim  1)  using  the  Affymetrix  10K  Xba\  array.  Using  24 
genetically  matched  NAT  and  PT  pairs  and  12  matching  metastatic  lymph  nodes  (MLN),  we 
genotyped  a  total  of  8,430  tag  SNPs.  Analysis  of  the  genotype  data  with  GeneSpring  GTv2 
(Agilent)  for  allelic  imbalance  (Al),  indicative  of  loss-of-heterozygosity  (LOH),  showed  that  the 
majority  of  Al  events  occurred  from  normal  to  primary  tumor,  while  the  transition  from  primary 
tumor  to  metastasis  was  marked  by  few  little  additional  events  (0-2%).  In  the  NAT/PT 
comparison,  we  identified  five  regions  with  significant  Al:  9p24.1-p23  (38%),  5q33.3  (25%), 
3q25.32-q26.1,  13q14  and  17p13  (all  21%),  and  3p23-p22.3  (13%).  Our  overall  objective  was  to 
investigate  regions  of  frequent  Al  in  HNSCCs  that  are  shared  with  NSCLC  to  identify  candidate 
tumor  suppressor  genes  as  well  as  to  assess  whether  additional  “hits”  at  these  loci  were  present 
or  likely. 

Due  to  limited  resolution,  we  hypothesized  that  the  10K  SNP  array  probably  underestimates 
LOH,  resulting  in  missed  LOH  over  short  genomic  segments  and  poorly  defined  regional 
boundaries.  To  identify  additional  tumors  with  smaller  losses  and  to  pinpoint  the  most  frequently 
lost  genes  within  the  six  most  frequently  lost  regions  (9p,  3p,  3q,  5q,  1 1  q,  and  17p),  we 
performed  LOH  analysis  in  27  N/P  and  14  N/M  pairs  using  a  total  of  35  microsatellite  (MS) 
markers.  In  all  cases,  the  MS  LOH  data  confirmed  the  LOH  identified  by  the  SNP  array.  For  all 
frequently  lost  regions,  the  MS  data  identified  loss  in  additional  tumors  that  were  not  detected 
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on  the  SNP  array.  This  was  particularly  striking  for  three  regions,  namely  3p26.1-p25.2,  1 7p1 3 
and  1 1q23.2-q23.3.  Although  SNP  LOH  at  3p26  and  1 1  q23  was  most  frequently  observed  in 
the  NAT/MLN  comparison,  MS  revealed  that  LOH  was  much  more  frequent  in  N/P.  The  most 
frequently  lost  genes  are  shown  in  the  table  below;  all  are  genes  previously  reported  to  be 
associated  with  tumorigenesis. _ 


Cytoband 

Tumor 

Suppressor 

Candidate 

Comment 

3p26.1-25.2 

VHL 

Involved  in  tumor  angiogenesis 

3q25.31 

RARRES1 

Down-regulated  by  methylation  in  prostate  cancer, 

5q33.3 

CCNJL 

Cyclin  J-like,  cell  cycle  regulation 

9p24.1-p23 

PTPRD 

Regulates  cell  growth  and  transformation 

1 7p1 3 

TP53 

Classic  tumor  suppressor,  cell  cycle,  repair 

1 1q23.2 

TAGLN 

Transgelin,  early  marker  for  transformation 

Table  1:  Frequently  lost  gene  regions  in  SNP  arrays 


The  9p24.1-p23  region  was  identified  in  the  SNP  analysis  as  the  most  frequently  lost  region  in 
this  sample  set.  This  region  contained  only  a  single  gene,  PTPRD,  and  we  focused  on  this  gene 
as  a  candidate  tumor  suppressor.  We  compared  LOH  at  PTPRD  with  LOH  at  TP53  and 
CDKN2A,  both  genes  reported  as  being  frequently  lost  early  in  tumorigenesis.  We  found  that 
LOH  at  PTPRD  in  both  primary  tumors  and  metastasis  (17/26  P=65%,  9/13  M=69%)  was  as 
frequent  as  LOH  at  TP53  in  primary  tumors  and  even  more  frequent  in  metastases  (16/26 
N/P=62%,  4/13  N/M=31%). 

To  address  the  question  of  whether  PTPRD  was  also  frequently  lost  in  other  tumor  types,  we 
compared  microsatellite  LOH  at  both  PTPRD  and  TP53  in  HNSCC  to  a  set  of  matched  TIN 
samples  from  lung  and  colorectal  (CRC)  cancers.  LOH  at  PTPRD  in  CRC  (3/9=33%)  was 
similar  to  LOH  at  TP53  (4/10=40%).  LOH  at  PTPRD  in  lung  cancer  (8/20=40%)  was  even  more 
frequent  than  at  TP53  (5/20=25%).  Thus  LOH  at  PTPRD,  while  not  quite  as  frequent  in  lung 
cancer  and  CRC  as  in  HNSCC,  was  nevertheless  frequently  seen.  Therefore,  PTPRD  may  be  a 
non-specific  marker  of  early  tumorigenesis  in  many  epithelial  tumors. 

As  both  PTPRD  and  RARRES1  have  CpG  islands  in  their  5’  promoter  regions  that  may  be 
subject  to  epigenetic  inactivation  by  promoter  hypermethylation,  we  investigated 
hypermethylation  as  a  “hit”  that  is  functionally  equivalent  to  loss  (reported  in  Aim  3). 

The  genes  TCF21  and  CHD5  had  been  identified  in  other  studies  as  possible  early  markers  of 
tumorigenesis.  We  also  investigated  LOH  at  these  two  genes  in  HNSCC  and  lung  cancer 
samples  using  MS  markers.  At  CHD5,  we  found  little  LOH  in  HNSCC  samples  (12%  in  PT  and 
10%  in  MLN)  and  only  slightly  more  in  lung  cancer  (33%  of  22  PT).  At  TCF21,  LOH  in  HNSCC 
was  striking  (59%  of  PT  and  50%  of  MLN);  lung  cancers  have  somewhat  less  LOH  at  this  locus 
(43%  of  37  PT). 
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Aim  3  To  evaluate  the  contribution  of  promoter  hypermethylation  and  transcriptional 
inactivation  of  known  cancer  genes  subject  to  epigenetic  silencing  to  cancer 
phenotype. 

Summary  of  Research  Findings 

Last  year,  we  reported  that  the  transcription  factor  TCF21  was  the  most  often  hypermethylated 
(>30%  methylation)  gene  in  both  HNSCC  (96%,  n=24)  and  NSCLC  (95%,  n=21).  This  finding 
was  confirmed  when  we  investigated  a  second  larger  validation  sample  set  of  108  NSCLC 
patients  with  complete  clinical  and  follow-up  data  (56  adenocarcinomas,  48  SCC,  and  25  other 
lung  cancers)  for  a  total  of  129  PT  and  45  matching  NAT/PT  pairs.  Overall,  73%  of  lung  cancers 
showed  TCF21  hypermethylation  (n=129).  Adenocarcinomas  and  SCCs  together  showed 
hypermethylation  in  78%  (n=100)  of  tumors,  while  other  lung  cancers  showed  hypermethylation 
in  64%  of  cases  (n=25).  Adenocarcinomas  compared  to  SCCs  showed  slightly  higher  levels  of 
(40%±2.5  vs.  37%±2.7)  and  more  frequent  (90%  vs.  70%)  methylation,  respectively. 

To  determine  the  effect  of  TCF21  promoter  hypermethylation  on  gene  expression,  we 
collaborated  with  Dr.  Ignacio  Wistuba  (Pathology  Core)  to  perform  TCF21  protein 
immunohistochemical  (IHC)  analysis  on  a  TMA  of  300  lung  cancer  cases.  Similar  to  our  DNA- 
based  methylation  data,  253  of  300  cases  [85%;  166/191  (87%)  adenocarcinoma  and  87/109 
(80%)  squamous  cell  carcinoma]  resulted  in  no  or  significantly  decreased  TCF21  expression. 
Correlation  of  methylation  status  with  available  clinical  factors  (Biostatistics  Core)  showed  a 
significant  correlation  with  histology  (adenocarcinoma,  p  =  0.0026),  sex  (females,  p  =  0.0209), 
and  smoking  status  (never  smokers,  p  =  0.0476).  However,  race,  TNM  stage,  and  prognosis 
were  not  significantly  associated.  Given  the  interesting  association  of  histology,  sex,  and 
smoking-status  (adenocarcinoma  in  female  never-smokers),  which  is  reminiscent  of  recent 
findings  for  EGFR  mutations  in  lung  cancer  patients,  we  investigated  the  possibility  that  EGFR 
mutation  status  might  predict  TCF21  expression.  Multivariate  analysis  showed  that  histology, 
but  not  gender  or  smoking  status,  was  associated  with  low  TCF21  expression.  However,  when 
adenocarcinoma  cases  (n=172)  were  considered  alone  by  both  univariate  and  multivariate 
analysis,  no  significant  correlation  was  observed. 

A  high  percentage  (85%)  of  the  lung  cancers  were  hypermethylated  at  TCF21.  Approximately 
35%  of  samples  showed  both  LOH  and  hypermethylation.  We  sequenced  the  remaining  tumors 
to  determine  whether  mutation  might  furnish  a  second  hit,  but  results  showed  no  TCF21  coding 
mutations  identified.  Based  on  our  results,  the  primary  mode  of  inactivation  of  TCF2'\  would 
appear  to  be  methylation  (>85%)  followed  by  deletion  (43-59%  depending  on  the  tumor  type, 
NSCLC  vs.  HNSCC).  Some  tumors  have  2  hits  (35%),  but  most  tumors  would  appear  to  have  a 
single  hit  (more  frequently  methylation  and  less  frequently  LOH),  suggesting  that  this  may  be 
another  case  of  a  haploinsufficient  tumor  suppressor  gene.  TCF21  is  an  important  basic  helix- 
loop-helix  transcription  factor  that  plays  an  important  role  in  the  epithelial-mesenchymal 
transition  (EMT),  and  haploinsufficiency  may  be  sufficient  to  be  tumorigenic.  Based  on  our 
methylation  profiling,  we  conclude  that  TCF21  promoter  hypermethylation  and  TCF21 
expression  are  good  biomarkers  of  both  early  lung  and  head  and  neck  cancer  (manuscript  in 
preparation). 

We  completed  the  evaluation  of  methylatable  TSG  [PTPRD  and  RARRES1 ].  According  to 
Knudson’s  “two-hit”  hypothesis,  second  hits  at  tumor  suppressor  genes  are  needed  for 
tumorigenesis.  In  our  expression  analysis  of  these  same  samples,  PTPRD  mRNA  was  seen  to 
be  reduced  approximately  2-fold  in  primary  tumors  when  compared  to  adjacent  normal  tissue 
(T=-2.1527).  In  order  to  determine  whether  the  reduced  expression  of  PTPRD  was  due  to 
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haploinsufficiency  or  hyper-methylation,  we  used  PyroMethA  to  investigate  methylation  at  both 
PTPRD  promoters.  The  maximum  percent  methylation  seen  in  any  primary  tumor  for  island  1 
was  27.8%  and  for  island  2  it  was  29.7%.  These  percentages  were  not  significantly  different 
from  maximum  levels  seen  in  the  matching  normal  samples  (28%  for  island  1  and  23%  for 
island  2).  The  largest  positive  change  seen  between  normal  and  tumor  was  22.7%  in  primary 
tumors  for  island  1  and  21.8%  for  island  2.  However,  the  mean  change  was  less  than  4%  and 
probably  within  our  range  of  experimental  error.  Overall,  it  does  not  appear  that  hyper- 
methylation  plays  an  important  role  in  PTPRD  loss  of  function.  The  observed  reduced 
expression  may  be  entirely  due  to  LOH,  and  PTPRD  may  therefore  be  a  haplo-insufficient  tumor 
suppressor. 

Since  RARRES1  had  previously  been  reported  to  be  hypermethylated  in  prostate  cancer  and 
squamous  cell  carcinoma  of  the  esophagus,  we  also  investigated  promoter  hypermethylation  at 
this  locus.  Approximately  24%  of  lung  cancer  and  10%  of  HNSCC  tumors  showed  significant 
hypermethylation,  as  did  a  number  of  cancer  cell  lines. 

We  also  performed  global  hypomethylation  of  tumors.  Loss  of  genome-wide  methylation  is  a 
common  feature  of  cancer,  and  the  degree  of  hypomethylation  has  been  correlated  with 
genomic  instability.  Global  methylation  of  repetitive  elements  possibly  arose  as  a  defense 
mechanism  against  parasitic  DNA  elements,  including  retrotransposons  and  viral  pathogens. 
Given  the  alterations  of  global  methylation  in  both  viral  infections  and  cancer,  we  examined 
genome-wide  methylation  levels  in  HNSCC,  in  which  a  specific  set  of  cancers  (oropharyngeal) 
has  been  causally  associated  with  human  papilloma  virus  (HPV).  LINE  hypomethylation  was 
more  pronounced  in  HPV-negative  than  in  HPV-positive  tumors.  Moreover,  genomic  instability, 
measured  by  genome-wide  LOH  and  SNP  analysis,  was  greater  in  HNSCC  samples  with  more 
pronounced  LINE  hypomethylation.  Global  hypomethylation  was  variable  in  HNSCC.  Its 
correlation  with  both  HPV  status  and  degree  of  LOH  as  a  surrogate  for  genomic  instability  may 
reflect  alternative  oncogenic  pathways  in  HPV-positive  versus  HPV-negative  tumors. 

Aim  4  To  determine  protein  signatures  of  treatments  of  erlotinib  and  other 
therapeutic  agents,  alone  or  in  combination,  in  NSCLC  and  identify  molecular 
predictors  of  response. 

Summary  of  Research  Findings 


During  the  last  project  period,  we  used 
adenocarcinoma  and  squamous  cell 
carcinomas  to  test  the  expression  levels 
when  treated  with  SAHA  (HDAC 
inhibitor)  and  5-Aza-citidine  (DNMT 
inhibitor).  Results  of  these  tests 
suggested  that  we  could  use  the  same 
tumor  models  to  develop  biomarkers 
able  to  predict  further  responses  to 
similar  treatment  regimens. 

In  our  current  study,  we  used  the 
adenocarcinoma  and  squamous  cell 
carcinoma  models  to  test  a  neurotrophic 
receptor  kinase  (Trk)  inhibitor  for  similar 
expression  profiles  noted  in  earlier 


Figure  1:  A,  microdissection  of  tumor  stroma  from  a  passage 
3  tumor  model  (dissection  area  is  outlined).  B,  PCR  (1,  equal 
amount  of  human  and  murine  DNA;  2,  DNA  from  stroma; 

3,  DNA  from  dissected  tumor  cells). 
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results.  Although  expression  of  Trk  can  be  detected  in  primary  tumors,  most  of  the  cancer  cell 
lines  lost  the  expression  profile  needed  for  validation  of  anti-Trk  agents.  We  screened  eight  of 
our  heterotransplant  lung  cancer  models  for  expression  of  Trks  (Trk-A,  Trk-B,  and  Trk-C)  and 
their  corresponding  growth  factors  (NGF,  BDNF,  and  NT3)  using  real-time  RT-PCR  (Table  1). 
We  detected  modest  expression  of  certain  Trks  and  their  ligands  in  the  tumors.  We  then 
conducted  in  vivo  testing  using  the  Trk  inhibitor  as  a  single  agent  to  determine  a  potential  anti¬ 
tumor  effect  in  the  heterotransplant  lung  cancer  models.  Five  different  tumor  models  were  used 
in  the  initial  testing,  using  only  one  animal  per  treatment  group  per  model  to  serve  for  screening 
purposes.  In  lOOmg/kg  dose,  the  tumor  inhibitory  effect  was  observed  in  3  of  the  5  models, 
particularly  in  one  of  the  adenocarcinoma  tumor  models  (model  7)  where  tumor  growth  was 
inhibited  by  78%  compared  to  the  vehicle-treated  control.  This  model  expressed  only  Trk-B  and 
its  ligand,  BDNF,  but  not  other  Trks  and  their  ligands.  We  then  conducted  second  stage 
experiment  by  focusing  on  the  two  models  (models  7  and  8)  showing  signs  of  tumor  inhibition. 
Five  animals  were  assigned  to  each  treatment  group  for  each  model.  Although  the  tumor  growth 
inhibition  was  minimal  for  model  8,  a  similar  inhibitory  effect  (71%  inhibition)  was  observed  in 
model  7  as  compared  with  vehicle  treated  controls. 


Model  No. 

Histology 

TrkA/NGF 
(2  power  -  ACT 
x  1E6) 

TrkB/BDNF 
(2  power  -  ACT 
x  1E6) 

TrkC/NT3 
(2  power  -  ACT  x 
1E6) 

Response 

To  Trk  inhibitor 

MDACC1 

Adeno 

0. 1/0.2 

1. 0/0.3 

0/0 

No  growth 
inhibitory  effect 

MDACC2 

Poorly  dif 

0/0 

0.2/0. 1 

0.4/3. 9 

No  growth 
inhibitory  effect 

MDACC3 

Squamous 

0/0 

0/0.2 

0/0.7 

MDACC4 

Squamous 

0/0 

2.4/0. 1 

0/1.0 

MDACC5 

Adeno 

1. 9/0.5 

0/7.8 

0/1.1 

MDACC6 

Adeno 

0/0 

0/0.1 

0/0.5 

Inhibitory 

MDACC7 

Adeno 

0/0 

0. 2/3.0 

0/0 

Inhibitory  (78%) 

MDACC8 

Adeno 

0.4/3. 0 

0/32.8 

0/0 

Inhibitory 

MDACC9 

Sqamous 

Not  tested 

Not  tested 

Not  tested 

Table  1:  Heterotransplant  lung  cancer  models 


In  previous  studies,  it  has  been  shown  that  early  passages  of  heterotransplant  tumors  contain 
infiltrating  inflammation  cells  that  disappear  in  later  passages  of  the  tumors.  This  observation  is 
reasonable  because  most  of  the  infiltrating  inflammation  cells  are  terminally  differentiated  and 
are  not  renewable.  However,  it  is  poorly  understood  whether  other  tumor  stroma  cells  that  may 
have  stem-cell  features  in  the  tumors  remain  of  human  origin  in  the  tumor  models.  The 
presence  of  tumor-stroma  may  help  evaluate  agents  that  target  the  tumor  microenvironment  as 
part  of  their  mechanisms  of  action.  Our  preliminary  data  from  three  passages  of  the  tumors 
suggests  the  presence  of  human-origin  tumor  stroma  (Fig.  1 ).  The  stroma  DNA  derived  from 
the  tumors  showed  predominantly  human  origin  whereas  DNA  from  the  tumor  cells  contained 
no  murine  component  (Fig.  1).  Although  additional  experiments  are  needed,  the  result  is 
encouraging. 
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Aim  5  To  determine  a  clinical  utility  of  the  molecular  predictors. 

Summary  of  Research  Findings 

This  aim  was  completed  and  summarized  in  previous  reports. 

Key  Research  Accomplishments 

•  Identified  shared  regions  of  LOH  and  allelic  imbalance  through  DNA  profiling  of  HNSCC  and 
NSCLC  on  matched  T/N  and  T  samples. 

•  Demonstrated  that  several  candidate  tumor  suppressor  genes  in  these  regions  are  subject 
to  epigenetic  silencing  by  gene-specific  hypermethylation  in  both  HNSCC  and  NSCLC. 

•  Confirmed  TCF21  promoter  hypermethylation  and  TCF21  expression  as  promising 
biomarkers  of  both  early  lung  and  head  and  neck  cancer. 

•  Observed  that  early  passages  of  heterotransplant  tumors  contain  infiltrating  inflammation 
cells  that  disappear  in  later  passages  of  the  tumors. 

Conclusions 


We  performed  RNA  and  DNA  profiling  on  available  samples  to  identify  genes  and  genomic 
regions  that  are  altered  in  NSCLC  and  interrogated  additional  candidate  methylatable  genes  as 
potential  tumor  suppressor  genes  in  NSCLC,  to  identify  their  potential  as  biomarkers.  These 
experiments  have  generated  a  list  of  molecular  candidate  biomarkers  for  further  investigation  in 
larger  sample  sets. 


Core  B:  Biostatistics  &  Data  Management  Core 

(Core  Director:  J.  Jack  Lee,  Ph.D.) 


The  Biostatistics  and  Data  Management  Core  has  continued  to  work  with  all  IMPACT  Projects  in 
their  research  efforts,  especially  in  the  area  of  biostatistical  support  in  clinical  trial  design, 
implementation,  and  analysis  of  experimental  results.  We  also  developed  statistical  methods  to 
enhance  the  design  and  analysis  pertinent  to  the  lung  cancer  research. 


Specific  Aims: 

1.  To  ensure  that  the  results  of  all  projects  are  based  on  well-designed  experiments  and 
are  appropriately  interpreted  by  providing  experimental  design;  sample  size  estimates; 
power  calculations;  and  integrated,  comprehensive  analysis  for  each  basic  science,  pre- 
clinical,  and  clinical  study. 

2.  To  develop  a  data  management  system  that  integrates  clinical,  pathological,  and  basic 
science  data  while  providing  data  integrity  through  process  tracking  and  quality  control. 

3.  To  provide  statistical  and  data  management  support  for  genomic  and  imaging  studies 
including  microarray,  proteomics,  and  molecular  targeted  imaging. 

4.  To  develop  and  adapt  innovative  statistical  methods  pertinent  to  biomarker-integrated 
translational  lung  cancer  studies. 

5.  To  produce  statistical  reports  for  all  projects. 

6.  To  collaborate  and  assist  all  project  investigators  with  the  publication  of  scientific  results. 
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Summary  of  Research  Findings 

Core  B  continues  to  provide  statistical  support  in  the  revision  and  conduct  of  the  clinical  trial 
proposed  in  Project  1  under  the  leadership  of  Dr.  Ritsuko  Komaki,  which  has  been  accruing  well 
in  recent  months.  In  addition,  we  continue  to  provide  statistical  support  for  the  clinical  trial 
(ZD6474  in  malignant  pleural  effusion)  proposed  in  the  Developmental  Research  Project  1 
under  the  direction  of  Drs.  Roy  Herbst  and  Carlos  Jimenez. 

We  worked  with  Dr.  Ignacio  Wistuba  on  the  statistical  analysis  of  EGFR/EGFR  ligand 
expression  in  paired  primary  lung  cancer  and  brain  metastasis  (Project  5).  The  biomarkers 
analyzed  include  EGF,  Amphiregulin,  TGF-alpha,  p-HER3,  HER3,  EGFR,  pEGFR,  and  ErbB-2. 

We  have  also  worked  on  the  study  with  GRP78,  IL1 1 R  and  EphA5  on  301  NSCLC  patients  with 
Dr.  Wistuba  and  Dr.  Masanori  Sato  (Project  3). 

We  performed  statistical  analysis  on  Dr.  Krahe’s  project  involving  TCF21  immunohistochemical 
expression  in  NSCLC  patients.  TCF21  is  novel  candidate  tumor  suppressor  gene,  so  the 
abnormality  is  loss  or  reduction  of  expression.  We  examined  the  immunohistochemical  (IHC) 
expression  of  the  protein  using  formalin-fixed  and  paraffin-embedded  tissues  from  308  NSCLC 
patients  (adenocarcinoma  and  squamous  cell  carcinoma  histologies)  placed  in  tissue 
microarrays  (TMA)  as  performed  by  the  Pathology  Core  (Dr.  Wistuba).  In  addition,  we 
collaborated  with  the  TCF21/EGFR  analyses  with  202  patients. 

We  also  worked  with  Dr.  Wistuba  on  the  statistical  analyses  on  immunohistochemical 
expression  of  the  multiple  biomarkers:  STAT1,  LCK,  DUSP6,  EGFR,  and  KRAS  with  306 
NSCLC  patients.  It  was  reported  that  a  five-gene  expression  signature  (DUSP6,  MMD,  STAT1, 
FIER3,  and  LCK)  was  an  independent  predictor  of  relapse-free  and  overall  survival  in  a  series  of 
185  NSCLCs.  Our  goal  was  to  study  the  protein  IHC  expression  of  4  of  those  genes  in  our  large 
series  of  NSCLC  TMAs  and  identify  if  the  IHC  expression  correlated  with  patients’ 
clinicopathologic  features  and  outcome. 

We  worked  with  Dr.  Wistuba  on  statistical  analysis  on  the  data  for  VEGFR  and  EGFR  pathway 
expression  in  lung  cancer  (284  observations),  as  well  as  an  analysis  of  IHC  expression  of  stem 
cell  markers  associated  to  NSCLC  clinico-pathologic  characteristics  with  Drs.  Yuan  and 
Wistuba. 

We  are  also  working  on  HER  family  analysis  including  EGFR,  Her2  and  Her3  and  their 
associated  ligands.  We  will  compare  the  expression  of  individual  markers  by  IHC  to  find  any 
link  between  individual  markers  and  clinical  (gender,  stage,  smoking  history,  etc)  and 
pathological  (total  NSCLC,  and  subtypes  adenocarcinoma  and  squamous  cell  carcinoma) 
parameters  and  survival  (recurrence-free  survival  and  overall  survival  at  5  years). 

Key  Research  Accomplishments 

•  Continued  to  provide  statistical  support  in  the  clinical  trials  for  Project  1  and  DRP-1 . 

•  Provided  statistical  support  for  Projects  2,  3,  4,  6,  and  Pathology  Core. 

•  Continued  to  work  closely  with  the  Project  4  PI  (Dr.  Reuben  Lotan)  on  synergy  studies  of 
combination  drug  treatments  in  cell  lines  to  determine  whether  the  effect  is  synergistic, 
additive,  or  antagonistic. 

•  Developed  a  flexible  semi-parametric  model  and  codes,  which  allows  fitting  very  general 
interaction  patterns  for  the  drug  combinations. 
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•  Developed  methods  and  provided  codes  to  construct  the  confidence  interval  for  the 
interaction  index  for  the  Emax  model. 

•  Developed  an  additive  hazards  model  with  time-varying  coefficients. 

•  Developed  a  new  Bayesian  cure  rate  model  to  estimate  the  cure  rate  and  threshold. 

•  Developed  a  cure  rate  model  with  covariate  measurement  errors. 

•  Developed  a  Bayesian  dose-finding  trial  design  with  multiple  drugs. 

Conclusions 

Core  B  continued  to  provide  statistical  analysis  and  data  management  support  for  all  research 
projects  in  the  IMPACT  study.  We  will  assist  the  other  Projects  with  their  ongoing  needs  to 
complete  the  proposed  studies. 


Core  C:  Pathology  Core 

(Director:  Ignacio  Wistuba,  M.D.) 

The  IMPACT  interdisciplinary  research  proposal  for  studying  targeted  therapy  of  lung  cancers 
has  required  extensive  histopathologic,  IHC,  and  molecular  studies  of  cell  and  tissues 
specimens,  which  have  been  assisted,  coordinated  or  performed  by  the  Pathology  Core.  One  of 
the  most  important  roles  of  the  Pathology  Core  has  been  to  provide  professional  technical 
services  for  proper  procurement,  storage  and  use  of  human  and  animal  tissues,  as  well  as 
technical  assistance  for  IHC  analysis.  In  addition,  the  Pathology  Core  has  provided  assistance 
for  collection  and  evaluation  of  tissue  specimens  in  IMPACT  clinical  trials  in  lung  cancer 
patients. 


Aim  1  Develop  and  maintain  repository  of  tissue,  cell  and  serum  specimens  from 
patients  with  lung  neoplasia,  as  requested  by  the  various  component  projects. 

Summary  of  Research  Findings 

During  the  current  project  period,  we  continued  the  collection  of  tissue  and  cell  specimens  for 
several  IMPACT  projects  with  a  large  emphasis  on  the  clinical  trials: 

Project  1.  For  the  IMPACT  clinical  trial  using  erlotonib  (TarcevaR),  chemotherapy,  and 
radiotherapy  in  advanced  non-small  cell  lung  cancer  (NSCLC)  patients  (Protocol  2005-1023; 
Principal  Investigator:  Dr.  R.  Komaki),  the  Pathology  Core  has  assisted  in  the  evaluation  of 
tumor  tissue  specimens  of  15  NSCLC  cases  for  biomarker  analysis.  Of  those,  8  cases  have 
been  considered  adequate  for  analysis  and  6  of  them  have  been  examined  for  EGFR  (exon  18- 
21)  and  KRAS  (codons  12,  13  and  61)  mutations,  EGFR  copy  number  by  fluorescent  in  situ 
hybridization  (FISH),  and  EGFR  protein  (total  and  phosphorylated  [p])  expression  by 
immnuhistochemistry.  No  EGFR  and  KRAS  mutations  were  detected  in  these  8  cases,  and  one 
case  showed  increased  EGFR  copy  number  (high  polysomy).  High  levels  of  EGFR  and  p-EGFR 
protein  expression  were  detected  in  two  cases. 

Project  2.  For  the  IMPACT  clinical  trial  using  ZD6474  in  NSCLC  patients  with  malignant  pleural 
effusion  (Protocol  2005-9029;  Principal  Investigator:  Dr.  C.  Jimenez),  we  have  collected, 
processed  and  banked  specimens  from  the  effusions  from  13  patients  at  baseline.  In  addition, 
we  have  collected  specimens  at  2,  3,  6  and  10  weeks  from  a  total  of  31  pleural  effusions.  From 
these  effusions,  we  have  processed  and  stored:  a)  frozen  cell  pellet;  b)  formalin-fixed  and 
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paraffin-embedded  (FFPE)  cell  pellets  blocks  and  histology  sections;  (c)  alcohol-fixed  and 
frozen  cell  smears;  e)  cell  pellets  frozen  in  RNA  later;  f)  cell  pellets  frozen  in  DMSO  12%. 

Project  6.  We  have  continued  collecting  fresh  NSCLC  tumor  tissue  specimens  that  have  been 
distributed  to  Dr.  Mao’s  laboratory  for  developing  mouse  tumor  heterotransplants.  To  date,  we 
have  provided  fresh  tissue  from  31  lung  cancers  (18  adenocarcinomas,  9  squamous  cell 
carcinomas  and  4  others  NSCLCs),  and  of  those,  10  cases  have  been  provided  since  the  last 
annual  report.  In  all  of  these  cases,  FFPE  human  tissue  specimens  have  been  collected, 
banked  and  examined  (see  Project  6  report).  In  addition,  our  collaboration  on  the  protein 
expression  and  methylation  analysis  of  TCF21  gene  in  nSCLC  (Project  6,  Dr.  Krahe)  was 
completed  last  year. 

Other  projects.  We  have  completed  our  work  on  several  other  key  projects  in  IMPACT.  Project 
2,  designed  to  analyze  HER  and  VEGF/R  and  HIF-1a  markers  in  human  NSCLC  tissue 
specimens,  was  completed  in  2007.  Project  3  was  completed  with  the  analysis  of  GRP78,  IL- 
1 1 R  and  Eph5A  markers  in  human  NSCLC  tissue  specimens.  Finally,  the  aim  designed  to 
analyze  bFGF,  receptors  FGFR1,  and  2  markers  (Project  4)  in  human  NSCLC  tissue  specimens 
was  completed  and  published  (see  attached  paper). 

Aim  2  Develop  innovative  tissue  and  cell  reagents  from  lung  cancer  patients  for  the 
investigation  and  validation  of  the  molecular  endpoints  relevant  to  each 
component  project. 

Summary  of  Research  Findings 

The  development  of  new  tissue  and  cell  reagents  from  lung  cancer  patients  was  focused  in  four 
different  methodologies: 

Tissue  and  cell  pellets  microarravs  (TMA) 

The  TMAs  prepared  in  previous  years  (primary  lung  tumors,  paired  primary  and  brain 
metastasis  NSCLC  tumors,  and  cell  line  pellets)  have  been  utilized  for  several  IHC  and  FISH 
analyses,  including  gene  copy  number  analysis  of  EGFR,  HER2  and  NKX2-1  ( TITF-1 ). 

Establishment  of  short-term  cultures  and  cancer  cell  lines  of  clinical  lung  tumor  specimens 
During  last  year,  our  repository  of  50  NSCLC  cell  lines  obtained  from  our  DoD  grants 
collaborators,  Drs.  John  Minna  and  Adi  Gazdar  (UT-Southwestern  Medical  Center,  Dallas,  TX), 
has  been  maintained.  In  addition,  we  have  prepared  cell  line  pellet  blocks  for  IHC  and  FISH 
analysis  form  47  of  these  cell  lines,  including  14  2-core  TMAs,  that  have  been  used  to  optimize 
new  antibodies  and  FISH  DNA  probes. 

Lung  cancer  heterotransplants  using  clinical  lung  tumor  samples 

As  stated  above,  10  new  specimens  have  been  provided  to  Dr.  Mao’s  lab  (Project  6)  to 
establish  these  models. 

Pleural  fluid  specimens 

In  collaboration  with  Dr.  C.  Jimenez  (Department  of  Pulmonary  Medicine),  we  have  established 
a  large  repository  of  lung  cancer  malignant  pleural  fluids  (Table  1). 
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Number 

Lung  Cancer 

Mesothelioma 

Other  Tumors 

Adenocarcinoma 

Squamous  Cell 
Carcinoma 

SCLC 

Cases 

80 

29 

8 

8 

96 

Samples 

Supernatant 

164 

58 

16 

16 

184 

Pellets  in  RNA  later 

79 

26 

8 

9 

85 

Cell  blocks 

96 

29 

8 

10 

90 

Cryopreserved 

Cells 

63 

20 

16 

22 

52 

Smears 

172 

62 

9 

30 

96 

Table  1.  Number  of  cases  and  sampl 

es  of  malignanl 

pleural 

1  fluid  collected  and  banked  by 

tumor  type. 

Aim  3  Process  human  and  animal  cell  and  tissues  for  histopathological, 
immunohistochemical  (IHC)  and  molecular  analyses,  including  tissue 
microdissection,  as  required  by  each  component  project. 

Summary  of  Research  Findings 

As  the  previous  years,  archival  and  prospectively  collected  frozen  and  FFPE  cell  and  tissue 
specimens  from  NSCLC  patients  have  been  processed  and  distributed  for  Projects  1,  2,  3,  4 
and  6. 

Aim  4  Perform  and  evaluate  IHC  analysis  in  human  and  animal  cell  and  tissue 
specimens,  as  required  by  the  various  component  projects. 

Summary  of  Research  Findings 

Most  of  the  studies  performed  in  collaboration  with  IMPACT  research  projects  have  been 
completed  as  reported  in  the  previous  year.  During  the  current  project  period,  we  have 
expanded  our  collaboration  with  Project  1  by  continuing  to  analyze  IHC  and  genetic  markers 
that  are  relevant  to  the  EGFR  and  VEGF/R  pathways  in  lung  cancer.  Three  main  projects  have 
been  performed  in  the  Pathology  Core  in  collaboration  with  this  project: 

Analysis  of  VEGFA  and  VEGFR-2  copy  number  and  mutation  in  NSCLC 
In  collaboration  with  Dr.  R.  Herbst  (Co-PI  Project  1 ),  we  previously  showed  that  VEGFR  and 
EGFR  pathways  are  positively  correlated  in  early  stage  NSCLC  and  IHC  expression  of  p-VEGF- 
R2  is  an  indicator  of  worse  overall  survival  in  stage  l-IIIA  NSCLC.  Recently,  we  have  performed 
VEGFA  and  VEGFR-2  gene  copy  number  analysis  by  quantitative  (q)-PCR  analysis  of  DNA 
extracted  from  50  NSCLC  cell  lines  and  101  frozen  tumor  specimens.  In  NSCLC  cell  lines, 
increased  copy  numbers  of  VEGF  and  VEGFR-2  has  been  detected  in  2%  and  5%, 
respectively.  Similar  frequencies  of  increased  copy  numbers  have  been  detected  in  tumor 
specimens  for  VEGFA  (3/10,  3%)  and  VEGFR-2  (5/101,  5%).  A  detailed  pathological  analysis  of 
tumor  specimens  having  these  gene  abnormalities  is  being  performed,  and  we  plan  to 
characterize,  among  other  markers,  the  microvascular  density  (MVD)  using  tissue  specimens. 

In  addition,  mutation  analysis  of  VEGFR-2  of  exons  7,  11,  21,  26  and  30  in  NSCLC  cell  lines 
and  tumor  specimens  is  underway.  We  have  detected  two  SNPs  (exons  1 1  and  21)  in  tumor  cell 
lines  and  only  one  potentially  activating  mutation  in  exon  21  (cgt946cat). 
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Analysis  of  the  IHC  expression  of  angiogenesis-related  markers  HIF-1a  and  carbonic  anhvdrase 
IX  (CA  IX)  in  tumor  specimens  from  330  NSCLC  patients  using  TMAs 

The  IHC  analysis  of  these  2  markers  has  been  completed,  and  the  preliminary  statistical 
analysis  demonstrated  that  NSCLC  tumors  expressed  relatively  high  levels  of  cytoplasmic  HIF- 
1a,  and  cytoplasmic  and  membrane  CAIX.  We  detected  low  levels  of  HIF-1  a  nuclear  expression 
in  tumors.  HIF-1  a  cytoplasmic  expression  did  not  correlate  with  tumor  histology  and  other 
clinicopathologic  features  except  smoking.  Cytoplasmic  HIF-1  a  was  significantly  (P<0.001) 
higher  in  smokers  compared  with  never-smokers. 

CAIX  membrane  expression  was  significantly  higher  in  squamous  cell  carcinomas  than 
adenocarcinomas  (P<0.001),  and  in  tumors  with  more  advanced  TNM  stage  (P=0.001). 
Interestingly,  patients  with  positive  membrane  CAIX  had  a  higher  risk  of  recurrence  (P=0.0096, 
HR  [95%C]  =  2.30  [1.22,  4.30]).  After  adjusting  for  lymph  node  (N)  stage,  the  membrane  CAIX 
effect  remains.  In  addition,  patients  with  positive  membrane  CAIX  had  higher  risk  of  recurrence 
(P=0.01,  HR  [95%CI]  =  2.23  [1.19,  4.19]).  In  multivariate  analysis,  and  aftter  adjusting  for 
histology,  age,  gender,  smoking  history,  N  stage  and  overall  TNM  stage,  patients  with  higher 
membrane  CAIX  had  higher  risk  of  death  (P<  0.05). 

Additional  IMPACT  research  activities  of  the  Pathology  Core 

The  Pathology  Core  has  performed  independent  research  activities  to  better  characterize  the 
EGFR  molecular  abnormalities  involved  in:  a)  the  progression  of  lung  cancer,  b)  its  correlation 
with  other  lung  cancer  molecular  abnormalities,  and  c)  to  establish  their  role  in  the  development 
of  lung  cancer  brain  metastasis.  During  the  last  year,  we  completed  and  submitted  some  of  the 
studies  for  publication.  As  a  result  of  this  additional  work  performed  by  the  Pathology  Core,  we 
published  one  paper  and  submitted  two  manuscripts  within  the  last  year.  We  also  have  data  for 
a  third  manuscript  that  is  currently  in  preparation.  In  addition,  four  abstracts  were  presented  at 
the  2008  AACR  Annual  Meeting  (San  Diego,  CA,  April  2008),  and  one  has  been  accepted  for 
presentation  in  the  2009  AACR  Annual  Meeting  (Denver,  CO,  April  2009). 

Completed  Projects: 

a)  EGFR  abnormalities  in  the  progression  of  lung  adenocarcinomas.  A  paper  by  X.  Tang  et 
al.  was  published  in  Cancer  Prevention  Research  (Cancer  Prev  Res  1:  192-200,  2008;  see 
attached  paper). 

b)  Association  of  EGFR  gene  abnormalities  with  estrogen  and  progesterone  receptors 
expression.  A  manuscript  authored  by  G.  Raso  et  al  has  been  submitted  to  Clinical  Cancer 
Research  (see  attached  manuscript).  This  manuscript  received  a  favorable  review  and  it  will  be 
resubmitted  with  the  revisions  in  March  2009. 

c)  Characterization  of  HER  family  receptors  markers  and  EGFR  gene  abnormalities  in 
NSCLC  primary  tumors  and  brain  metastasis.  A  manuscript  authored  by  M.  Sun  et  al  has 
been  submitted  to  Clinical  Cancer  Research  (see  attached  manuscript).  The  manuscript 
received  a  favorable  review  and  it  will  be  resubmitted  with  some  new  data  in  March  2009.  As 
requested  by  the  reviewers,  we  have  added  information  comparing  in  primary  tumors  and 
corresponding  brain  metastasis  the  protein  expression  by  IHC  of  5  HER  ligands,  including 
epiregulin,  betacellulin  (BTC),  heparin-binding  epidermal  growth  factor-like  growth  factor  (HB- 
EGF),  neuroregulin-1  (NRG1),  and  -2  (NRG2).  The  new  data  are  currently  under  statistical 
analysis  by  the  Biostatistical  Core  for  the  resubmission. 
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Ongoing  Projects: 

a)  Role  of  NKX2-1  (TITF-1)  gene  amplification  and  protein  overexpression  in  lung  cancer 
and  its  association  with  EGFR  abnormalities.  Following  our  data  indicating  that  NKX2-1 
amplification  by  FISH  correlated  with  outcome  in  patients  with  NSCLC,  including  both 
adenocarcinoma  and  squamous  cell  carcinoma  histology  (platform  presentation  in  the  2008 
AACR  Annual  Meeting,  San  Diego,  CA),  we  have  expanded  our  initial  observation  to  a  larger 
set  of  NSCLC  cases  and  established  a  DNA  q-PCR  methodology  for  analysis  of  NKX2-1  copy 
number  analysis.  This  work  has  been  accepted  for  platform  presentation  in  the  2009  AACR 
Annual  Meeting,  Denver,  CO  (see  attached  abstract).  A  manuscript  consolidating  all  these  data 
is  in  preparation  by  Ximing  Tang  et  al. 

b)  EGFR  and  HER2  copy  analysis  by  FISH  in  tumor  specimens  from  330  NSCLC  patients 
using  TMAs,  and  correlation  with  EGFR  and  KRAS  mutations  and  patients’ 
clinicopathologic  features.  During  the  last  year  we  have  completed  the  analysis  of  the  copy 
number  analysis  by  FISH  of  these  two  genes  in  a  large  series  of  NSCLC  tumors.  Gene  mutation 
and  patients’  clinicopathologic  data  are  available  and  the  statistical  analysis  by  the  Biostatistical 
Core  is  in  progress.  The  main  aim  of  this  study  is  to  address  whether  EGFR  and/or  HER2  gene 
amplification  identify  a  subset  of  NSCLC  patients  with  stages  l-IIIA  with  worse  outcome  and 
higher  incidence  of  tumor  metastasis. 

Key  Research  Accomplishments 

•  Developed  a  large  repository  of  lung  cancer  tissue,  cytology  and  cell  lines  specimens  with 
annotated  clinical  data,  to  be  utilized  for  research  projects. 

•  Developed  a  series  of  lung  cancer  heterotransplants  in  mice  in  collaboration  with  Project  6. 

•  Characterized  the  expression  (VEGF/VEGFR,  HIF-Ia/CAIX,  and  bFGF/receptors)  and 
molecular  abnormalities  ( VEGFA  and  VEGFR-2)  of  angiogenic  markers  in  NSCLC. 

•  Identified  the  sequence  of  EGFR  abnormalities  involved  in  the  progression  and  metastasis 
of  lung  adenocarcinomas. 

•  Established  the  correlation  between  EGFR  mutations  and  ER  expression  in  lung 
adenocarcinomas. 

•  Characterized  the  abnormalities  of  NKX2-1  ( TITF-1 )  in  NSCLC  and  its  association  with 
patients’  outcome. 

•  Presented  4  abstracts  at  the  2008  AACR  annual  meeting  with  an  additional  abstract  to 
presented  at  the  2009  annual  meeting. 

Conclusions 


The  Pathology  Core  has  assisted  and  collaborated  actively  with  several  research  projects  to 
perform  multiple  histopathological,  immunohistochemical,  and  genetic  studies  in  a  large  series 
of  lung  cancer  tissue  (archival  and  prospectively  collected  in  the  IMPACT  clinical  trials) 
specimens.  In  addition,  the  Pathology  Core  has  managed  to  conduct  specific  research  activities, 
which  fully  integrate  with  some  of  the  IMPACT  research  projects.  Due  to  the  direct  contribution 
of  the  Pathology  Core,  four  abstracts  have  been  presented  and  one  has  been  accepted  for 
presentation  at  international  meetings,  two  papers  have  been  published,  and  two  manuscripts 
are  in  resubmission  process.  The  Pathology  Core  has  successfully  fulfilled  the  goals  proposed 
for  the  fourth  year  of  the  IMPACT  project. 
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Core  D:  Imaging  Core:  Provide  Imaging  Support  for  IMPACT  Projects 

(PI  and  co-PI:  Juri  Gelovani,  M.D.,  Ph.D.;  Chun  Li,  Ph.D.) 


Summary  of  Research  Findings 

Project  2.  During  the  past  reporting  period,  the  Imaging  Core  D  completed  the  development 
and  evaluation  of  standard  operating  procedures  and  chemistry,  manufacturing,  and  control 
sections  of  an  IND  for  a  Phase  I  clinical  study  on  PET  imaging  with  18F-PEG6-IPQA  to  assess 
radiation  dosimetry  and  radiolabeled  metabolites  in  patients  with  NSCLC  (Fig.  1 ).  Specifically, 
the  Core  developed  computer-controlled 
automated  procedures  for  the  radiolabeling  of 
18F-PEG6-IPQA  and  QC/QA  procedures  required 
for  preparation  of  an  IND  application  to  the  FDA. 

The  Core  performed  a  series  of  PET/CT  imaging 
studies  in  non-human  primates  (rhesus 
macaques)  to  assess  the  pharmacokinetics, 
biodistribution,  metabolism,  and  radiation 
dosimetry  of  18F-PEG6-IPQA,  which  is  required 
for  preparation  of  an  IND  application  to  the  FDA. 

The  results  of  this  study  are  described  in  the 
report  for  Project  2. 

Also,  the  Imaging  Core  has  conducted  the  final 
series  of  18F-PEG6-IPQA  microPET  imaging 
studies  in  mice  bearing  different  NSCLC 
xenografts,  described  in  the  report  for  Project  2. 

Project  4.  The  Imaging  Core  continued  to  conduct  bioluminescence  imaging  (BLI)  studies  in 
mice  bearing  s.c.  xenografts  of  A549  carcinomas  expressing  GFP-Luc  (GL)  dual-modality 
reporter  gene  (previously  transfected  by  the  Imaging  core).  The  expression  of  GL  reporter  gene 
in  lung  carcinoma  xenografts  growing  in  mice  was  assessed  by  BLI  after  i.p.  injection  of  D- 
luciferin  (Biosynth,  Int.,  Naperville,  IL)  substrate  at  150  mg/kg  per  mouse  10  minutes  prior  to 
imaging  with  IVIS200  system  (Xenogen,  Almeda,  CA). 


Key  Research  Accomplishments 

•  Produced  novel  18F-PEG6-IPQA  derivative  for  PET  imaging  studies  in  small  animals  and 
non-human  primates  for  Project  2. 

•  Maintained  SOPs  and  performed  QC/QA  procedures  for  routine  synthesis  of  ^In-DTPA- 
PEG-AnnexinV  for  SPECT/CT  imaging  in  Project  I  and  performed  imaging  studies. 

•  Performed  microPET/CT  imaging  (and  autoradiography)  studies  in  mice  with  orthotopic 
models  of  different  NSCLC  using  18F-PEG6-IPQA  in  Project  2. 

•  Performed  routine  synthesis  of  [18F]FEAU  for  Project  3  and  conducted  microPET 
imaging  studies  for  this  project. 

•  Performed  synthesis  and  64Cu/111ln  radiolabeling  of  novel  cyclic  nanopeptides  for  animal 
imaging  studies  in  Project  3. 

•  Contracted  the  GLP  studies  on  toxicology  of  F-PEG6-IPQA  to  Charles  River 
Laboratories  (MA). 
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Conclusions 


The  Imaging  Core  continues  to  provide  highly  specialized  imaging  support  as  requested  by  the 
project  leaders  of  IMPACT  for  their  designated  research  projects. 


DRP-1:  Treatment  of  Malignant  Pleural  Effusion  with  ZD6474,  a  Novel  VEGFR  and  EGFR 
TK  Inhibitor 

(PI  and  co-PI:  Roy  Herbst,  M.D.,  Ph.D.,  Carlos  Jimenez,  M.D.) 


Aim  1  To  determine  clinical  effect  of  ZD6474. 

Aim  2  To  investigate  biological  correlates. 

Aim  3  To  investigate  radiographic  correlates. 

Aim  4  To  assess  quality  of  life. 

Summary  of  Research  Findings 

The  amended  single  arm,  open-label  study  to  evaluate  the  efficacy  of  ZD6474  on  the 
management  of  pleural  effusion  in  NSCLC  patients  was  activated  in  June  of  2007.  To  date,  17 
patients  have  been  enrolled  on  the  study. 

Six  patients  are  excluded  from  the  following  analysis: 

•  Five  patients  did  not  receive  medication: 

•  Non-compliance  (1) 

•  Benign  pleural  effusion  (2) 

•  Non-amenable  for  IPC  placement  (1 ) 

•  Renal  dysfunction  (1) 

•  One  patient  was  not  evaluable  due  to  placement  of  a  defective  intrapleural  catheter. 

Days  with  intrapleural  catheter  in  place 

Nine  patients  have  had  their  intrapleural  catheters  removed  at  days  13,  14,  14,  15,  22,  28  67,  68 
and  142.  One  patient  with  IPC  dislodged  on  day  15  and  was  removed  from  the  trial  on  day  23 
due  to  emphysema.  The  last  patient  still  has  the  intrapleural  catheter  in  place  after  126  days. 

Weeks  on  study  and  related  adverse  events 

Four  patients  stopped  medication  at  week  6  due  to  disease  progression.  One  patient’s 
symptoms  stopped  the  medication  at  week  6  after  developing  neurological  symptoms  and 
hyponatremia.  Another  patient  with  a  defective  intrapleural  catheter  was  excluded  from  the 
study  at  week  6.  One  patient  stopped  the  medication  at  week  7  due  to  recurrence  of  QTc 
prolongation  after  dose  reduction.  Four  patients  completed  10  weeks  of  treatment,  one  patient 
completed  18  weeks  of  treatment  and  currently  two  patients  are  on  medication  at  weeks  17  and 
32. 
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Response 

Six  patients  had  stable  disease,  one  patient  had  a  treatment  failure  due  to  emphysema  but  with 
stable  disease,  one  patient  had  a  partial  response  and  three  patients  displayed  disease 
progression. 

Correlative  studies 

In  collaboration  with  Dr.  Wistuba  (Core  C  Director),  we  have  been  collecting  pleural  effusion 
specimens  including  cryopreserved  cell  pellets,  supernatants,  FFPE  blocks,  smears,  and  RNA 
for  correlative  studies  (see  Table  2,  Core  C). 

Key  Research  Accomplishments 

•  Enrolled  1 7  out  of  25  patients  on  the  clinical  trial. 

•  Collected  specimens  for  correlative  analyses  on  all  patients  (4  pleural  fluid  samples  and  four 
blood  samples  per  patient). 

Conclusions 


The  trial  is  enrolling  patients;  all  analyses  will  be  performed  after  the  trial  is  completed. 


DRP-2:  TALK  -  Teens  and  Young  Adults  Acquiring  Lung  Cancer  Knowledge 

(PI:  Alexander  V.  Prokhorov,  M.  D.,  Ph.D.) 

Ninety  percent  of  lung  cancer  cases  in  adults  are  direct  results  of  smoking.  In  children  and 
young  adults,  tobacco  use  remains  a  major  public  health  problem  in  spite  of  the  recent  declines 
in  smoking  prevalence  among  children  and  adolescents.  Over  the  past  2-3  decades,  numerous 
factors  of  smoking  initiation  among  adolescents  have  been  thoroughly  investigated.  A 
considerable  volume  of  literature  is  currently  available  providing  important  clues  with  respect  to 
designing  tobacco  prevention  and  cessation  among  youth. 

Focusing  on  this  major  public  health  problem  -  tobacco  use  among  young  individuals  and  lack 
of  in-depth  knowledge  of  lung  cancer  issues  -  Project  TALK  (Teens  and  Young  Adults  Acquiring 
Lung  Cancer  Knowledge)  was  conceived  and  funded  as  a  smoking  cessation/prevention  pilot 
project  for  culturally  diverse  high-risk  young  populations  that  include  school  drop-outs, 
economically  disadvantaged,  and  underserved.  Using  modern  technologies,  the  Departments 
of  Behavioral  Science  and  Thoracic/Head  &  Neck  Medical  Oncology  have  joined  their  efforts  to 
conduct  this  developmental  project  under  the  leadership  of  Dr.  Alexander  V.  Prokhorov.  The 
project  will  assist  in  making  major  advances  in  lung  cancer  education  and  prevention  among 
youth.  Project  TALK  will  produce  a  CD-ROM-based  education/behavior  change  for  teenagers 
and  young  adults  (15-24  years  of  age). 

We  have  thus  been  devoting  our  effort  in  4  tasks  as  described  in  the  Statement  of  Work  based 
on  the  project  timeline: 

Taskl.  Develop  intervention  program.  Focus  groups  will  be  held  with  adolescents  and 
young  adults  to  ensure  we  are  capturing  the  essence  of  the  program,  using  the  right 
messages,  and  employing  the  appealing  video  and  animated  characters.  (Years  1-2) 

Task  2.  Develop  and  beta-test  CD-ROM.  This  includes  the  design  of  the  animation, 
illustrations,  scripts  and  accompanying  videos.  (Years  1-2) 
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Task  3.  Implement  program  in  agreed  upon  locations  and  recruit  young  adults  to 
participate  in  the  study.  (Years  3-4) 

Task  4.  Collect  and  analyze  data.  (Years  3-4) 


Summary  of  Research  Findings 

Participant  Recruitment 

A  total  of  239  high-risk  alternative  school  students  were  recruited  from  the  Houston  area.  Mean 
age  of  the  participants  was  16.2  years  (SD=1.0),  79%  were  male.  Thirty-six  percent  were 
Hispanic  and  49%  were  African-American.  Sixty-eight  percent  were  in  high  school  and  32%  in 
middle  school.  The  majority  (70%)  were  not  employed  and  30%  were  employed  part-time  or  full¬ 
time.  Approximately  25%  were  current  smokers  and  29%  former  smokers. 

The  recruitment  process  at  Fifth  Ward  Enrichment  Center  was  assisted  by  the  program’s 
Executive  Director;  a  school  counselor  at  M.R.  Wood  assisted  in  recruiting  the  participants  for 
the  study.  At  M.R.  Wood,  the  students  were  notified  of  the  project  by  the  school  counselor  and 
incoming  students  were  given  orientation  packages  that  included  the  informed  consent  to  be 
completed  and  turned  in  to  the  teaching  staff  or  counselor.  Other  students  were  recruited  by 
Project  TALK  staff  through  informal  interactions  with  students  who  had  heard  about  the  game 
and  were  interested  in  participating. 

Description  of  Educational  Videogame 

The  educational  videogame,  available  in  both  CD-ROM  and  kiosk  form,  uses  a  hospital 
metaphor  in  which  participants  “travel”  to  different  “rooms”  to  learn  about  the  dangers  of  tobacco 
use.  The  game  allows  users  to  personalize  the  experience  by  indicating  their  age,  gender, 
race/ethnicity,  smoking  status,  level  of  addiction  and  readiness  to  quit.  Depending  on  the  user's 
smoking  status,  the  user  enters  a  “nonsmoker”  or  “smoker”  educational  track.  Each  track  has  a 
different  set  of  hospital  rooms  to  explore,  from  which  users  pick  up  necessary  items  required  to 
“escape”  from  the  hospital  successfully.  The  various  rooms  educate  users  on  the  causes  of  lung 
cancer,  role  of  smoking  in  lung  cancer  development  and  tobacco  cessation/prevention. 

Testing 

The  on-site  program  testing  was  completed  in  May  2008. 

Survey 

The  evaluation  instruments  of  the  program  have  been  designed  and  consist  of  a  baseline 
survey,  7-day  follow-up  survey,  and  6-month  follow-up  survey.  Both  the  baseline  survey  and  7- 
day  follow-up  survey  were  used  at  the  target  sites.  The  6-month  follow-up  survey  was 
administered  over  the  phone.  Participants  are  then  given  the  option  to  leave  comments  or 
suggestions  with  regard  to  their  perception  of  the  program  and  how  it  can  be  improved. 

The  6-month  follow-up  survey  contains  all  the  same  questions  as  the  7-day  follow-up  survey, 
but  was  conducted  by  phone  rather  than  in  person. 

Seven  Day  Feasibility  Analysis 

The  primary  outcomes  for  the  7-day  feasibility  evaluations  included:  (1)  Ease  of  use  and 
enjoyment  of  various  features  of  the  game;  (2)  The  quality  of  the  interactive  multi-media 
presentations  and  the  environment  and  their  ability  to  generate  interest  in  smoking  and  lung 
cancer  prevention;  (3)  Motivational  appeal,  educational  value  and  behavior  modification  post- 


56 


Army  Award  W81XWH-05-2-0027;  Waun  Ki  Hong,  M.D. 

Annual  Report:  Reporting  Period  15  February  2008  -  14  February  2009 


completion  of  game;  and  (4)  Post-completion  knowledge  of  modules  in  the  game.  Comparisons 
of  these  outcomes  by  smoking  status  were  done  using  analysis  of  variance,  chi-square  tests 
and  nonparametric  methods  as  appropriate. 

Results 

The  7-day  feasibility  survey  was  completed  by  213  participants.  The  demographic  profile  of 
these  participants  was  similar  to  the  demographic  profile  of  the  239  participants  at  baseline. 
Mean  age  of  the  213  participants  was  16.1  years  (SD=1.0),  78%  were  male.  Thirty-six  percent 
were  Hispanic  and  51%  were  African-American.  Over  50%  reported  playing  the  game  two  or 
more  times.  The  top  three  highly  rated  rooms  were  surgery/operating  theater,  patient  records 
and  TV/theater  rooms.  Over  87%  of  participants  reported  ease  of  use  with  all  aspects  of  the 
game  and  the  majority  of  participants  enjoyed  most  features  in  the  game.  After  playing  the 
game,  94%  reported  that  the  materials  increased  their  knowledge  about  smoking  effects  and 
77%  reported  that  they  were  inspired  never  to  start  or  to  quit  and  70%  planned  to  share  the 
game  with  family  or  friends.  Post-completion  knowledge  of  modules  was  assessed.  The 
participants  were  tested  on  their  knowledge  about  specific  modules  and  the  majority  of 
participants  reported  that  smoking  and  secondhand  smoke  is  harmful  to  health,  harmful  to  the 
unborn  child  and  expensive.  However,  knowledge  regarding  nicotine  addiction  and  the  more 
severe  consequences  of  smoking  on  mortality  compared  to  other  addictions  was  low. 

Six-Month  Feasibility  Analysis 

The  6-month  survey  has  been  completed  by  146  participants.  All  participants  played  the  game 
at  least  once  and  40%  played  twice  or  more  times.  Over  85%  of  participants  reported  ease  of 
use  of  this  educational  tool  and  the  majority  of  participants  enjoyed  the  experience.  After  playing 
the  game,  94%  reported  increased  knowledge  about  the  tobacco  effects,  82%  were  inspired 
never  to  start  or  to  quit,  and  82%  planned  to  share  the  game  with  family  or  friends. 

Mediating  Variables  of  Smoking  at  6  month  Follow-up 

The  primary  outcomes  of  interest  in  this  study  were  the  mediating  determinants  of  smoking 
initiation  and  cessation  including  the  pros  and  cons  of  tobacco  use,  decisional  balance  and 
temptations  to  smoke.  These  outcomes  were  analyzed  using  mixed  model  regression  with  time 
and  baseline  smoking  status  and  their  interaction  as  fixed  effects.  There  was  a  significant 
interaction  effect  for  cons  of  smoking  (F  =  5.3;  p<.05),  decisional  balance  (F  =  8.0;  p<.01),  and 
temptations  to  smoke  (F  =  7.6;  p<.01).  For  baseline  smokers,  6-month  temptations  to  smoke 
were  significantly  lower  than  baseline,  and  6-month  cons  of  smoking  were  significantly  higher 
than  baseline.  For  nonsmokers,  these  variables  did  not  change  significantly. 

Smoking  Initiation  and  Cessation 

Overall  rates  of  self-reported  smoking  cessation  and  initiation  at  6-month  follow-up  were  not 
verified  by  cotinine  or  CO  validation.  Out  of  146  participants  who  responded  at  6  months,  34 
were  baseline  smokers  (23%)  and  112  were  non-smokers  (77%).  At  6  months,  18  (53%)  out  of 
34  baseline  smokers  reported  that  they  were  abstinent  and  7  (6%)  out  of  112  baseline  non- 
smokers  reported  that  they  had  initiated  smoking.  When  stratified  by  gender  and  ethnicity,  the 
results  showed  that  more  males  and  Hispanics  quit  smoking  compared  to  females  and  non- 
Hispanics,  although  the  results  were  not  statistically  significant  due  to  small  numbers.  Those 
who  quit  also  had  significantly  lower  scores  on  the  pros  of  smoking,  decisional  balance,  and 
temptations  to  smoke  compared  to  those  who  remained  smokers  (p<.05). 

Key  Research  Accomplishments 

•  Performed  testing  at  two  sites  with  a  total  of  239  participants. 
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•  Completed  7  day  follow-up  survey  with  a  total  of  213  participants. 

•  Completed  6  month  follow-up  survey  with  a  total  of  146  participants. 

Conclusions 


Project  TALK  was  successfully  developed  according  to  the  timeline.  It  produced  an  innovative, 
highly  informative,  and  easy  to  navigate  videogame  that  was  enthusiastically  accepted  by  young 
individuals  at  high  risk  of  initiating  tobacco  use.  More  than  half  the  smokers  who  participated 
had  quit  smoking  at  the  6-month  follow-up.  Overall,  results  have  shown  an  increased  level  of 
awareness  regarding  the  dangers  of  smoking  across  the  participants  enrolled.  We  aim  to  place 
our  informational  materials  in  other  strategic  areas  across  the  community  to  duplicate  results 
seen  in  this  project. 


Career  Developmental  Project  (CDP1):  Identification  of  Membrane  Proteins  in  Bronchial 
Epithelia  Cells  as  Biomarkers  of  Early  Detection  for  Lung  Cancer 

(PI:  Shin-Myung  Kang,  M.D.) 

Lung  cancer  is  the  leading  cause  of  cancer  deaths.  Early  detection  of  the  malignant  lesion  leads 
to  an  improved  5-year  survival  rate  after  surgical  resection.  Therefore,  advanced  screening  tools 
are  needed  urgently  to  detect  lung  cancer  at  an  early  stage  to  improve  control  of  such  deadly 
lung  cancer. 


Aim  1  To  isolate  membrane  proteins  uniquely  expressed  on  the  surface  of  squamous 
metaplasia  using  organotypically  cultured  bronchial  epithelial  cells. 

Summary  of  Research  Findings 

Membrane  proteins  were  isolated  from  in  vitro  models  of  squamous  metaplastic  bronchial 
epithelial  cells  and  compared  with  that  of  normal  mucocilliary  bronchial  epithelial  cells  by  2- 
dimensional  polyacrylamide  gel  electrophoresis.  Normal  human  tracheobronchial  epithelial 
(NHTBE)  cells  from  passage  3  originated  from  samples  collected  from  smokers  and  non- 
smokers;  samples  were  cultured  with  or  without  retinoic  acid  for  the  formation  of  in  vitro  model 
of  squamous  metaplasia  by  air-liquid  interface  method.  Cells  were  harvested  at  12  days  after 
air-liquid  formation.  Non-small-cell  lung  cancer  cell  lines  (H226,  H2170)  were  grown  on  plastic 
plate  in  RPMI  1640  containing  10%  fetal  bovine  serum.  H226  and  H2170  cells  were  harvested 
at  80%  confluence.  Membrane  proteins  were  extracted  using  ReadyPrep™  Protein  Extraction 
Kit  (Membrane  I)  (Bio-Rad)  according  to  the  manufacturer’s  instruction.  To  analyze  the 
differentially  expressed  proteins,  20  pg  of  membrane  protein  extracted  was  subjected  to  two- 
dimensional  PAGE  analysis.  The  samples  were  loaded  onto  7-cm  immobilized  pH  gradient 
strips  (pH  3-10,  nonlinear),  which  were  passively  rehydrated  for  15  hours.  Isoelectric  focusing 
was  conducted  using  a  Protein  isoelectric  focusing  cell  system  for  40,000V/h.  After  equilibration 
process,  the  strips  were  applied  to  10%  nongradient  SDS-PAGE  gel,  and  the  gels  were  stained 
using  Silver  Stain  Plus  kit  (Bio-Rad). 
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Figure  1:  2-D  PAGE  using  membrane  proteins  from  normal  human  tracheobronchial  epithelial  (NHTBE)  cells,  in 
vitro  model  of  squamous  metaplasia,  and  squamous  cell  carcinomas  of  lung  (H226,  H2170).  Isoelectric  point:  3-10 
(from  left  to  right) 

left  upper:  NHTBE  from  smoker 
left  lower:  NHTBE  from  non-smoker 
center  upper:  squamous  metaplasia  from  smoker 
center  lower:  squamous  metaplasia  from  non-smoker 
right  upper:  H226  (squamous  cell  carcinoma  of  lung) 
right  lower:  H2170  (squamous  cell  carcinoma  of  lung) 


NHTBE 

Metaplasia 

4  •**!— ^*  1 

-wiu:  •  f 

#  *  * 

Figure  2:  Highly  expressed  membrane  proteins  in  squamous  metaplasia 
(arrow)  compared  to  NHTBE  cells. 
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Metaplasia  from  non-smoker  Metaplasia  from  smoker 


Figure  3:  Highly  expressed  membrane  proteins  of  squamous  metaplasia  were  reproducibly  detected  in  other  sample 
sets. 


We  compared  the  expression  pattern  of  membrane  proteins  of  NHTBE  cells  and  squamous 
metaplasia  originating  from  smokers  or  non-smokers.  There  was  no  significant  difference  in 
membrane  protein  expression  on  2-DE  between  smokers  and  non-smokers  (Fig.  1).  We  then 
compared  the  expression  pattern  of  membrane  proteins  of  NHTBE  cells  to  those  of  the 
squamous  metaplasia  resulting  in  similar  pattern  displays.  However,  some  spots  had  higher 
expression  patterns  squamous  metaplasia  compared  to  NHTBE  cells  (Fig.  2).  Highly  expressed, 
membrane  proteins  of  squamous  metaplasia  were  reproducibly  detected  in  other  sample  sets 
(Fig.  3).  Last,  we  compared  expression  pattern  of  membrane  proteins  of  NHTBE  cells  to  those 
of  non-small-cell  lung  cancer  cell  lines  (H226,  H2170;  squamous  cell  carcinoma).  Membrane 
proteins  from  squamous  cell  carcinomas  displayed  very  different  pattern  of  expression 
compared  to  those  from  NHTBE  cells. 

Aim  2  To  identify  differentially  represented  proteins  using  proteomics. 

Summary  of  Research  Findings 

Using  HPLC-tandem  Mass  Spectrometer  in  collaboration  with  the  Proteomics  Core  facility  in  our 

institute,  we  sequenced  and 
determined  the  specific  identity  of 
the  proteins  isolated  in  Aim  1 .  After 
being  excised,  the  silver-stained 
spots  were  destained  and  digested 
in  gel  with  200  ng  of  modified  trypsin 
(Promega).  The  resulting  peptides 
were  analyzed  by  nano-liquid 
chromatography-tandem  mass 
spectrometry  with  on-line  desalting 
with  a  Famos  autosampler,  an 
Ultimate  nano-LC  module,  and  a 
Switchos  precolumn  switching 
device  (LC  Packings/Dionex). 


NHTBE  NSCLC;  Sq 


Figure  4:  Two  spots  selected  for  protein  identification  of  squamous  cell 
carcinoma  of  lung  H226  (arrow  &  arrow  head). 
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Electrospray  ion-trap  mass  spectrometry  was  done  using  an  LTQ  linear  ion-trap  mass 
spectrometer  (Thermo).  The  fragment  spectra  were  analyzed  using  the  National  Center  for 
Biotechnology  Information  nonredundant  protein  database  and  the  Mascot  search  engine 
(Matrix  Science). 

We  decided  to  compare  membrane  protein  expression  of  NHTBE  cells  to  that  of  squamous  cell 
carcinoma  (H226).  We  selected  two  spots  (Fig.  4)  which  were  expressed  exclusively  in 
squamous  cell  carcinoma  of  lung  on  2-DE  gel,  and  we  identified  these  spots  (Table  1  and  2).  As 
shown,  we  did  not  find  any  membrane  proteins  that  matched  the  molecular  weight  and 
isoelectric  point  to  the  spots.  Thus,  we  are  now  repeating  the  identification  process  by  mass 
spectrometry  using  the  same  membrane  proteins  trypsin-digested.  In  addition,  we  are  preparing 
another  set  of  membrane  proteins  from  the  NHTBE  cells,  squamous  metaplasia,  H226,  and 
H2170  to  verify  the  reproducibility  of  the  expression  pattern  on  2-DE  gel.  To  minimize  the 
keratin  contamination,  the  pre-cast  gradient  gels  will  be  used  in  following  experiments. 


Protein 

Accession  no. 

pi 

Mr  (kDa) 

Mascot  score 

Keratin  1 

AAG41947 

8.16 

66.1 

609 

Unnamed  protein  product 

CAA32649 

5.17 

59.5 

577 

Keratin,  type  II  cytoskeletal  2  epidermal  (cytokeratin- 
2e) 

P35908 

8.07 

65.9 

265 

Electron  transfer  falvoprotein,  alpha  polypeptide 
isoform  b 

NP_001 121 188 

8.75 

30.0 

91 

Keratin  9 

CAA52924 

5.14 

62.0 

81 

Table  1.  Identified  proteins  from  the  upper  spot  in  2-DE  gel  of  H226  (arrow) 


Protein 

Accession  no. 

Pi 

Mr  (kDa) 

Mascot  score 

Unnamed  protein  product 

CAA32649 

5.17 

59.5 

916 

Keratin,  type  II  cytoskeletal  2  epidermal 
(cytokeratin-2e) 

P35908 

8.07 

65.9 

911 

Keratin,  type  II  cytoskeletal  1 

P04264 

8.16 

66.0 

853 

Keratin  9 

CAA52924 

5.14 

62.0 

455 

Chain  A,  human  serum  albumin  in  a  complex  with 
myristic  acid  and  tri-iodobenzoic  acid 

1 BKE_A 

5.69 

66.0 

444 

Keratin  6 A 

NP  005545 
NP_490596 

8.09 

60.0 

441 

Protein 

Accession  no. 

Pi 

Mr  (kDa) 

Mascot  score 

Keratin  type  II 

AAC41769 

8.09 

60.2 

438 

Unnamed  protein  product 

BAG64893 

5.79 

58.9 

428 

Keratin  5 

AAH24292 

7.58 

62.4 

426 

Keratin 

AAB59562 

5.09 

51.7 

418 

Haptoglobin  Hp2 

1006264A 

6.23 

41.7 

391 

Keratin  17 

NP  000413 

4.97 

48.1 

251 

Complement  component  3,  isoform  CRA-a 

EAW69070 

8.24 

143.7 

224 

Keratin  1b 

CAD91892 

5.73 

61.8 

219 

Serpin  peptidase  inhibitor,  class  A  (alpha-1 
antiproteinase,  antitrypsin),  member  1 

AAH 15642 

5.37 

46.7 

188 

Alpha-fibrinogen  precursor 

AAA52426 

8.27 

69.8 

144 

Fibrinogen  gamma  chain 

AAB59531 

5.61 

49.5 

132 

Keratin  19 

BAA94608 

4.93 

11.8 

130 

Beta-fibrinogen  precursor 

AAA52429 

8.31 

54.9 

102 

Unnamed  protein  product 

BAC05019 

8.32 

51.6 

88 

Alpha-2-macroglobulin  precursor  (alpha-2-M) 

P01023 

6.00 

163.3 

85 

Transferrin 

NP  001054 

6.81 

77.0 

80 

Ig  kappa  chain  C  rgion  (allotype  lnv(1,2)) 

A37927 

5.60 

11.0 

70 

G  protein-coupled  receptor  kinase  interactor  1 
variant 

BAD92774 

6.63 

85.9 

45 

hCG2020995 

EAX01464 

6.87 

12.4 

45 

Table  2.  Identified  proteins  from  the  lower  spot  in  2-DE  gel  of  H226  (arrow  head) 
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Aim  3  To  verify  the  differentially  represented  proteins  using  PCR,  Western  blotting, 
and  immunocytochemistry. 

Summary  of  Research  Findings 

Work  on  this  Aim  is  dependant  upon  further  verification  of  the  results  gained  in  Aim  2  on  the 
differentially  expressed  membrane  proteins.  Confirmation  will  be  obtained  by  real-time  PCR, 
Western  blotting  analysis,  and  immunohistochemical  staining  of  the  putative  biomarkers  on  lung 
cancer  specimens  in  tissue  microarrays  in  collaboration  with  the  Pathology  Core. 

Key  Research  Accomplishments 

•  Demonstrated  that  membrane  proteins  from  squamous  cell  carcinomas  displayed  very 
different  pattern  of  expression  compared  to  those  from  NHTBE  cells. 

•  Sequenced  and  determined  the  specific  identity  of  the  proteins  isolated  from  NHTBE  and 
squamous  cell  carcinomas. 

Conclusions 


Comparisons  of  NHTBE  cells  and  squamous  metaplasias  from  smokers  and  non-smokers  have 
shown  that  squamous  metaplasias  display  a  unique  pattern  of  expression  compared  to  other 
cell  sets  in  our  project.  We  have  sequenced  and  identified  the  specific  identity  of  the  proteins 
isolated  in  Aim  1 .  Further  tests  were  run  with  different  cell  lines  to  duplicate  and  confirm  these 
results,  but  we  could  not  find  any  specific  membrane  points  that  were  identified  in  the  earlier 
studies.  Thus,  we  are  now  repeating  the  identification  process  by  mass  spectrometry  using  the 
same  membrane  proteins  trypsin-digested.  In  addition,  we  are  preparing  another  set  of 
membrane  proteins  from  the  NHTBE  cells,  squamous  metaplasia,  H226,  and  H2170  to  verify 
the  reproducibility  of  the  expression  pattern  on  2-DE  gel.  To  minimize  the  keratin  contamination, 
the  pre-cast  gradient  gels  will  be  used  in  following  experiments.  We  hope  that  future  results  will 
further  indicate  that  unique  membrane  proteins  can  be  used  as  biomarkers  of  early  detection  for 
lung  cancer. 
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KEY  RESEARCH  ACCOMPLISHMENTS 


Project  1:  Targeting  epidermal  growth  factor  receptor  signaling  to  enhance  response  of 
lung  cancer  to  therapeutic  radiation. 

•  Entered  22  patients  onto  the  erlotinib  (Tarceva)  plus  radiotherapy  for  locally  advanced 
NSCLC  trial  and  completed  evaluation  of  12  of  these  patients. 

•  Discovered  relationship  between  the  epithelial-to-mesenchymal  transition  and 
radiosensitivity  of  NSCLC  cells. 

•  Demonstrated  that  pretreatment  with  gefitinib  exerts  a  radioprotection  of  H1299-CDH1 
cells. 

•  Demonstrated  that  small  molecule  inhibitors  of  both  c-Met  and  IGF-1R  produce  a 
significant  radiosensitizing  effect  on  NSCLC  cells. 

•  Completed  an  assessment  of  the  combination  of  erlotinib  (Tarceva)  and  radiation  in  a 
NSCLC  xenograft  tumor  model. 

Project  2:  Molecular  imaging  of  EGFR  expression  and  activity  in  targeted  therapy  of  lung 
cancer 


•  Developed  the  fifth-generation  of  more  water-soluble  polyethylene  glycol  (PEG)-ylated  3- 
iodo-4-(phenylamino)quinazoline-6-acrylamide  (IPQA)  derivatives  labeled  with  18F  to 
simplify  the  translation  of  EGFR  kinase  imaging  agents  in  the  clinic  in  comparison  with 
124I-JGAP5. 

•  Discovered  [18F]fluoro-hexa-PEG-IPQA  has  less  hepato-biliary  clearance  and  a  longer 
plasma  circulation  half-time. 

•  Completed  the  evaluation  of  [18F]fluoro-hexa-PEG-IPQA  as  a  novel  PET  imaging  tracer 
in  six  rhesus  macaques. 

•  Initiated  manufacturing  the  non-radio  labeled  (cold)  fluoro-hexa-PEG-IPAQ  and  the 
precursor  with  Advanced  Biomedical  Compounds  (ABX).  The  precursor  (compound  9 
from  scheme  3)  will  be  used  to  manufacture  18F-fluoro  hexa  PEG-IPQA  clinical  dose  for 
phase  I  study. 

•  Finalized  the  toxicology  protocol  for  the  proposed  rodent  study  with  Charles  River  Labs. 

•  Improved  lab-scale  radiosynthesis  to  meet  cGMP  manufacturing  guidelines. 

•  Held  a  preliminary  meeting  with  oncologists  to  discuss  patient  recruitment  and  study 
objectives  for  a  future  clinical  trial. 

•  Presented  study  results  at  the  2008  World  Molecular  Imaging  Congress. 


Project  3:  Targeted  peptide-based  systemic  delivery  of  therapeutic  and  imaging  agents  to 
lung  tumors 

•  Tested  peptides,  selected  by  homing  to  blood  vessels  in  mouse  models,  as  carriers  to 
guide  the  delivery  of  cytotoxic  drugs,  pro-apoptotic  peptides,  metalloprotease  inhibitors, 
cytokines,  fluorophores,  and  genes. 

•  Identified  vascular  receptors  that  correspond  to  the  selected  peptides  found  in  blood 
vessels  of  normal  organs  and  tumors. 

•  Received  “safe  to  proceed”  status  by  the  FDA  on  our  BMTP-11  based  IND.  Human 
clinical  trials  will  be  activated  by  March  2009. 

•  Filed  a  second  IND  with  the  FDA  to  focus  on  BMTP78. 


63 


Army  Award  W81XWH-05-2-0027;  Waun  Ki  Hong,  M.D. 

Annual  Report:  Reporting  Period  15  February  2008  -  14  February  2009 


•  Generated  stable  and  biologically  active  networks  of  direct-assembled  Au-phage 
scaffolds,  in  which  we  can  tune  the  chemical  and  physical  properties  of  these  biological 
structures. 

•  Identified  vascular  markers  targeted  by  circulating  ligands  to  further  our  knowledge  of 
complex  cellular  and  molecular  diversity  of  the  human  vasculature. 

Project  4:  Inhibition  of  bFGF  Signaling  for  Lung  Cancer  Therapy 

•  Suppressed  the  growth  of  lung  tumor  cells  by  repressing  FGFR  activation  with  the  FGF 
receptor  inhibitor  MTA. 

•  Demonstrated  that  dominant  negative  FGF  receptor  adenoviral  expression  vector 
inhibits  the  growth  of  lung  cancer  cell  lines  by  inducing  apoptosis. 

•  Concluded  that  bFGF,  FGFR1,  and  FGFR2  are  highly  expressed  in  most  NSCLC 
tumors. 

Project  5:  Targeting  mTOR  and  Ras  signaling  pathways  for  lung  cancer  therapy 

•  Indicated  that  RAD001  increases  elF4E  phosphorylation  in  vivo  in  lung  xenografts. 

•  Demonstrated  that  mTOR  inhibition  induced  activations  of  the  MEK/ERK  signaling 
pathway  in  some  cancer  cell  lines  after  a  prolonged  treatment. 

•  Observed  co-targeting  of  mTOR  and  PI3K/Akt  signaling  exerts  enhanced  anticancer 
activity  in  lung  cancer  xenograft  models.  mTOR  inhibitors  increase  elF4E 
phosphorylation  in  lung  cancer  xenografts. 

•  Modulation  of  elF4E  phosphorylation  may  serve  as  a  predictive  biomarker  for 
RADOOI/erlotinib-based  cancer  therapy. 

•  Accrued  8  patients  on  the  clinical  trial. 

Project  6:  Identification  and  Evaluation  of  Molecular  Markers  in  Non-Small  Cell  Lung 

Cancer  (NSCLC) 

•  Identified  shared  regions  of  LOH  and  allelic  imbalance  through  DNA  profiling  of  HNSCC 
and  NSCLC  on  matched  T/N  and  T  samples. 

•  Demonstrated  that  several  candidate  tumor  suppressor  genes  in  these  regions  are 
subject  to  epigenetic  silencing  by  gene-specific  hypermethylation  in  both  HNSCC  and 
NSCLC. 

•  Confirmed  TCF21  promoter  hypermethylation  and  TCF21  expression  as  good 
biomarkers  of  both  early  lung  and  head  and  neck  cancer. 

•  Observed  that  early  passages  of  heterotransplant  tumors  contain  infiltrating  inflammation 
cells  that  disappear  in  later  passages  of  the  tumors. 

Core  B:  Biostatistics  &  Data  Management  Core 

•  Continued  to  provide  statistical  support  in  the  clinical  trials  for  Project  1  and  DRP-1 . 

•  Provided  statistical  support  for  Projects  2,  3,  4,  6,  and  Pathology  Core. 

•  Continued  to  work  closely  with  the  Project  4  PI  (Dr.  Reuben  Lotan)  on  synergy  studies  of 
combination  drug  treatment  in  cell  lines  to  determine  whether  the  effect  is  synergistic, 
additive,  or  antagonistic. 

•  Developed  a  flexible  semi-parametric  model  and  codes  which  allow  fitting  very  general 
interaction  patterns  for  the  drug  combinations. 
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•  Developed  methods  and  provide  codes  to  construct  the  confidence  interval  for  the 
interaction  index  for  the  Emax  model. 

•  Developed  an  additive  hazards  model  with  time-varying  coefficients. 

•  Developed  a  new  Bayesian  cure  rate  model  to  estimate  the  cure  rate  and  threshold. 

•  Developed  a  cure  rate  model  with  covariate  measurement  errors. 

•  Developed  a  Bayesian  dose-finding  trial  design  with  multiple  drugs. 

Core  C:  Pathology  Core 

•  Developed  a  large  repository  of  lung  cancer  tissue,  cytology  and  cell  lines  specimens 
with  annotated  clinical  data,  to  be  utilized  for  research  projects. 

•  Developed  a  series  of  lung  cancer  heterotransplants  in  mice  in  collaboration  with  Project 
6. 

•  Characterized  the  expression  (VEGF/VEGFR,  HIF-Ia/CAIX,  and  bFGF/receptors)  and 
molecular  abnormalities  ( VEGFA  and  VEGFR-2)  of  angiogenic  markers  in  NSCLC. 

•  Identified  the  sequence  of  EGFR  abnormalities  involved  in  the  progression  and 
metastasis  of  lung  adenocarcinomas. 

•  Established  the  correlation  between  EGFR  mutations  and  ER  expression  in  lung 
adenocarcinomas. 

•  Characterized  the  abnormalities  of  NKX2-1  ( TITF-1 )  in  NSCLC  and  its  association  with 
patients’  outcome. 

•  Presented  4  abstracts  at  the  2008  AACR  annual  meeting  with  an  additional  abstract  to 
presented  at  the  2009  annual  meeting. 

Core  D:  Imaging  Core 

•  Produced  novel  18F-PEG6-IPQA  derivative  for  PET  imaging  studies  in  small  animals  and 
non-human  primates  for  Project  2. 

•  Maintained  SOPs  and  performed  QC/QA  procedures  for  routine  synthesis  of  111ln-DTPA- 
PEG-AnnexinV  for  SPECT/CT  imaging  in  Project  I  and  performed  imaging  studies. 

•  Performed  microPET/CT  imaging  (and  autoradiography)  studies  in  mice  with  orthotopic 
models  of  different  NSCLC  using  18F-PEG6-IPQA  in  Project  2. 

•  Performed  routine  synthesis  of  [18F]FEAU  for  Project  3  and  conducted  microPET 
imaging  studies  for  this  project. 

•  Performed  synthesis  and  64Cu/111ln  radiolabeling  of  novel  cyclic  nanopeptides  for  animal 
imaging  studies  in  Project  3. 

•  Contracted  the  GLP  studies  on  toxicology  of  F-PEG6-IPQA  to  Charles  River 
Laboratories  (MA). 

DRP-1:  Treatment  of  Malignant  Pleural  Effusion  with  ZD6474,  a  Novel  VEGFR  and  EGFR 

TK  Inhibitor 

•  Enrolled  17  out  of  25  patients  on  the  clinical  trial. 

•  Collected  specimens  for  correlative  analyses  on  all  patients  (4  pleural  fluid  samples  and 
4  blood  samples  per  patient). 

DRP-2:  TALK  -  Teens  and  Young  Adults  Acquiring  Lung  Cancer  Knowledge 

•  Performed  testing  at  two  sites  with  a  total  of  239  participants. 

•  Completed  7  day  follow-up  survey  with  a  total  of  213  participants. 
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•  Completed  6  month  follow-up  survey  with  a  total  of  146  participants. 

CDP1:  Identification  of  Membrane  Proteins  in  Bronchial  Epithelia  Cells  as  Biomarkers  of 
Early  Detection  for  Lung  Cancer 

•  Demonstrated  that  membrane  proteins  from  squamous  cell  carcinomas  displayed  very 
different  pattern  of  expression  compared  to  those  from  NHTBE  cells. 

•  Sequenced  and  determined  the  specific  identity  of  the  proteins  isolated  from  NHTBE  and 
squamous  cell  carcinomas. 
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CONCLUSIONS 

Project  1:  We  conclude  that  the  epithelial-to-mesenchymal  transition  (EMT)  plays  a  significant 
role  in  governing  not  just  the  intrinsic  radiosensitivity  of  NSCLC  cells,  but  also  their  sensitivity  to 
inhibitors  of  the  epidermal  growth  factor  receptor  (EGFR)  and  the  ability  of  such  inhibitors  to 
radiosensitize  these  cells.  It  would  be  useful  to  assess  the  EMT  status  of  patients  treated  with 
these  combinations.  In  spite  of  this  finding,  results  suggest  that  such  combinations  might  be 
useful  in  the  clinic.  In  addition,  we  conclude  that  targeting  other  growth  factor  receptors  such  as 
the  c-Met  and  IG-F1R  receptors  may  be  an  alternative  strategy  to  using  EGFR  inhibitors. 

Project  2:  The  development  of  [18F]fluoro-hexa-PEG-IPQA  into  a  clinical  trial  has  been  actively 
pursued  with  clearly  defined  milestones  and  timelines.  Studies  will  focus  on  determining  the 
optimum  dosimetry  of  [18F]fluoro-hexa-PEG-IPQA  injection  based  on  critical  organ  safety  and 
detection  sensitivity.  Data  will  be  obtained  on  agent  distribution,  pharmacokientics,  radiation 
dosimetry,  and  metabolites  to  build  upon  our  current  knowledge  base  and  to  shape  future 
studies.  We  have  initiated  all  fronts  of  the  IND  development  work  to  expedite  the  translational 
process. 

Project  3:  We  propose  that  the  EphA5  receptor  expressed  on  the  surface  of  lung  cancer  cells 
controls  cells  proliferation  and  survival  by  activating  the  MAP  kinase  signaling.  We  showed  that 
our  EphA5-targeted  peptides  inhibit  phosphorylation  of  ERK1/2  and  JNK.  These  effects  would 
potentially  result  in  inhibition  of  lung  cancer  cell  proliferation  and  induction  of  apoptosis.  Future 
studies  will  allow  us  to  use  shRNA  to  inhibit  the  expression  of  EphA5  in  lung  cancer  cells  and  to 
analyze  cell  proliferation  and  survival  in  vitro  and  in  vivo.  We  will  over-express  EphA5  receptor 
in  normal  cells  and  analyze  whether  the  expression  of  the  receptor  will  cause  cell  transformation 
in  vitro  and  in  vivo.  We  will  also  treat  lung  cancer  tumor-bearing  mice  with  both  systemic 
injection  and  nasal  inhalation  of  peptide.  These  will  follow  optimization  studies  for  peptide 
stability,  the  use  of  liposome  as  carriers,  and  a  dose-escalation  experiment. 

Project  4:  During  the  period  of  performance,  we  have  shown  that  MTA,  as  an  effective  FGFR 
inhibitor,  is  a  potent  cell  growth  suppressor  and  apoptosis  inducer,  and  should  be  further 
evaluated  in  preclinical  setting  for  anti  tumor  effects.  Our  research  data  demonstrated  that  MTA 
inhibited  the  growth  of  the  lung  tumorigenic  cell  line  1170-1  in  a  dose-dependent  fashion  with 
50%  inhibition  that  was  achieved  in  our  lab.  These  findings  led  us  to  study  adenoviral  vectors 
harboring  dominant  negative  FGFR1  as  potential  candidates  for  preclinical  in  vivo  studies  with 
aerosolized  viral  formulation.  Studies  on  the  mechanism  by  which  DNFGFR1  acts  to  inhibit  the 
growth  and  induce  apoptosis  in  lung  tumor  cells  are  ongoing.  Our  preliminary  results  indicate 
that  DNFGFR1  increases  the  cell  cycle  inhibitors  p21  and  p27  and  decreases  the  cell  cycle 
regulators  cdc25  and  Chkl .  We  look  to  complete  these  studies  during  the  next  project  period  to 
confirm  that  the  bFGF  signaling  pathway  activation  may  be  an  early  indicator  of  SCC 
pathogenesis  and  a  novel  target  for  lung  cancer  chemopreventive  and  therapeutic  strategies. 

Project  5:  Targeting  the  mTOR  axis  appears  to  be  a  promising  strategy  against  lung  cancer. 
Given  the  nature  of  the  complexity  of  lung  cancer  signaling  pathways,  including  mTOR 
signaling,  it  is  essential  to  understand  the  biology  of  lung  cancer  and  the  mechanism  of  action 
for  the  therapeutics  of  interest  in  order  to  efficiently  treat  lung  cancer  through  application  of 
mechanism-driven  therapeutic  regimens.  Thus,  we  have  demonstrated  the  scientific  rationale 
for  our  effort  in  pursuing  mTOR-targeted  lung  cancer  therapy. 

Project  6:  We  performed  RNA  and  DNA  profiling  on  available  samples  to  identify  genes  and 
genomic  regions  that  are  altered  in  NSCLC  and  interrogated  additional  candidate  methylatable 
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genes  as  potential  tumor  suppressor  genes  in  NSCLC,  to  identify  their  potential  as  biomarkers. 
These  experiments  have  generated  a  list  of  molecular  candidate  biomarkers  for  further 
investigation  in  larger  sample  sets. 

Biostatistics  Core:  Core  B  continued  to  provide  statistical  analysis  and  data  management 
support  for  all  research  projects  in  the  IMPACT  study.  We  will  assist  the  other  Projects  with 
their  ongoing  needs  to  complete  the  proposed  studies. 

Pathology  Core:  The  Pathology  Core  has  assisted  and  collaborated  actively  with  several 
research  projects  to  perform  multiple  histopathological,  immunohistochemical,  and  genetic 
studies  in  a  large  series  of  lung  cancer  tissue  (archival  and  prospectively  collected  in  the 
IMPACT  clinical  trials)  specimens.  In  addition,  the  Pathology  Core  has  managed  to  conduct 
specific  research  activities,  which  fully  integrate  with  some  of  the  IMPACT  research  projects. 
Due  to  the  direct  contribution  of  the  Pathology  Core,  four  abstracts  have  been  presented  and 
one  has  been  accepted  for  presentation  at  international  meetings,  two  papers  have  been 
published,  and  two  manuscripts  are  in  resubmission  process.  The  Pathology  Core  has 
successfully  fulfilled  the  goals  proposed  for  the  third  year  of  IMPACT  project. 

Imaging  Core:  The  Imaging  Core  continues  to  provide  highly  specialized  imaging  support  as 
requested  by  the  project  leaders  of  IMPACT  for  their  designated  research  projects. 

DRP-1:  The  trial  is  enrolling  patients;  all  analyses  will  be  performed  after  the  trial  is  completed. 

DRP-2:  Project  TALK  was  successfully  developed  according  to  the  timeline.  It  produced  an 
innovative,  highly  informative,  and  easy  to  navigate  videogame  that  was  enthusiastically 
accepted  by  young  individuals  at  high  risk  of  initiating  tobacco  use.  More  than  half  the  smokers 
who  participated  had  quit  smoking  at  the  6-month  follow-up.  Overall,  results  have  shown  an 
increased  level  of  awareness  regarding  the  dangers  of  smoking  across  the  participants  enrolled. 
We  aim  to  place  our  informational  materials  in  other  strategic  areas  across  the  community  to 
duplicate  results  seen  in  this  project. 

CDP1:  Comparisons  of  NHTBE  cells  and  squamous  metaplasias  from  smokers  and  non- 
smokers  have  shown  that  squamous  metaplasias  display  a  unique  pattern  of  expression 
compared  to  other  cell  sets  in  our  project.  We  have  sequenced  and  identified  the  specific 
identity  of  the  proteins  isolated  in  Aim  1.  Further  tests  were  run  with  different  cell  lines  to 
duplicate  and  confirm  these  results,  but  we  could  not  find  any  specific  membrane  points  that 
were  identified  in  the  earlier  studies.  Thus,  we  are  now  repeating  the  identification  process  by 
mass  spectrometry  using  the  same  membrane  proteins  trypsin-digested.  In  addition,  we  are 
preparing  another  set  of  membrane  proteins  from  the  NHTBE  cells,  squamous  metaplasia, 
H226,  and  H2170  to  verify  the  reproducibility  of  the  expression  pattern  on  2-DE  gel.  To  minimize 
the  keratin  contamination,  the  pre-cast  gradient  gels  will  be  used  in  following  experiments.  We 
hope  that  future  results  will  further  indicate  that  unique  membrane  proteins  can  be  used  as 
biomarkers  of  early  detection  for  lung  cancer. 
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Confidence  Intervals  of  Interaction  Index 
for  Assessing  Multiple  Drug  Interaction 


J.  Jack  Lee  and  Maiying  Kong 


Studies  of  interactions  among  biologically  active 
agents  have  become  increasingly  important  in  many 
branches  of  biomedical  research.  We  consider  that  the 
Loewe  additivity  model  is  one  of  the  best  general  ref¬ 
erence  models  for  defining  drug  interactions.  Based  on 
the  Loewe  additivity  model,  synergy  occurs  when  the 
interaction  index  is  less  than  one,  and  antagonism  oc¬ 
curs  when  interaction  index  is  greater  than  one.  Starting 
from  the  Loewe  additivity  model  and  the  marginal  dose- 
effect  curve  for  each  drug  involved  in  a  combination,  we 
first  present  a  procedure  to  estimate  the  interaction  in¬ 
dex  and  its  associated  confidence  interval  at  a  combina¬ 
tion  dose  with  observed  effects.  Following  Chou  and  Ta- 
lalay’s  method  for  assessing  drug  interaction  based  on 
the  plot  of  interaction  indices  versus  effects  for  combi¬ 
nation  doses  at  a  fixed  ray,  we  then  construct  a  pointwise 
(1  —  a)  x  100%  confidence  bound  for  the  curve  of  interac¬ 
tion  indices  versus  effects.  We  found  that  these  methods 
work  better  on  the  logarithm  transformed  scale  than  on 
the  untransformed  scale  of  the  interaction  index.  We  pro¬ 
vide  simulations  and  case  studies  to  illustrate  the  perfor¬ 
mances  of  these  two  procedures,  and  present  their  pros 
and  cons.  We  also  provide  S-Plus/R  code  to  facilitate  the 
implementation  of  these  two  procedures. 

Key  Words:  Antagonism;  Loewe  additivity  model;  Synergy. 


1.  Introduction 

Studies  of  interactions  among  biologically  active 
agents,  such  as  drugs,  carcinogens,  or  environmental  pol¬ 
lutants,  have  become  increasingly  important  in  many 
branches  of  biomedical  research.  Our  research  group  re¬ 


viewed  the  literature  (Lee,  Kong,  Ayers,  and  Lotan  2007) 
and  agree  with  many  researchers  (e.g.,  Berenbaum  1985, 
1989;  Greco,  Bravo,  and  Parsons  1995)  that  the  Loewe 
additivity  model  should  be  considered  as  the  “gold  stan¬ 
dard”  for  defining  drug  interactions 

For  a  combination  of  k  drugs  ( k  >  2)  at  (d\, . . . ,  <4), 
based  on  the  Loewe  additivity  model,  drug  interactions 
at  this  combination  can  be  characterized  as 


<  1, 

synergy; 

=  1, 

additivity; 

(i) 

.>  1, 

antagonism. 

Here  d\, ...  ,dk  are  doses  of  each  drug  in  the  mix¬ 
ture  of  the  k  drugs  resulting  in  effect  y,  and  Dy\,  . . 
Dy  k  are  the  doses  of  drugs  that  result  in  the  effect  y 
for  each  respective  drug  given  alone.  The  summation, 
tt1 — F  •  •  •  +  ,  is  called  the  interaction  index,  which 

is  denoted  as  z.  Based  on  the  Loewe  additivity  model, 
the  combination  dose  (d\, ... ,  <4)  is  said  to  be  syner¬ 
gistic  if  the  interaction  index  is  less  than  the  constant 
number  of  1,  and  additive  or  antagonistic  if  the  index  is 
equal  to  or  greater  than  1,  respectively.  To  give  an  in¬ 
tuitive  idea  about  the  interaction  index,  we  illustrate  its 
meaning  in  the  special  case  of  k  =  2.  Note  that  the  com¬ 
bination  dose  ( d\ ,  di)  produces  the  same  effect  y  as  drug 
1  alone  at  dose  level  Dy\,  and  drug  2  alone  at  dose  level 
Dy  2,  which  implies  that  1  unit  of  drug  2  will  produce  the 
same  effect  as  units  of  drug  1.  Thus,  the  amount  of 

dose  at  the  combination  ( d\ ,  di)  equals  to  d\  +  di 
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in  terms  of  drug  1  dose.  By  definition,  r  =  +  jyf, 

which  implies  that  d\  +  r  <  1  implies 

d\  +  =  r/)y, i  <  Dy  i,  therefore,  the  amount  of 

the  combination  dose  ( d\ ,  df)  to  produce  the  same  effect 
y  is  less  than  the  amount  of  dose  when  single  drug  is 
applied,  hence,  indicating  synergy.  The  smaller  r  is,  the 
more  is  the  reduction  of  the  amount  of  dose  in  the  combi¬ 
nation,  and  the  stronger  is  the  synergy.  Similarly,  r  >  1 
implies  that  d\  +  <^27^  =  T^y,l  >  Dy,  1,  that  is,  the 
amount  of  the  combination  dose  ( d\ ,  J2)  producing  the 
same  effect  y  is  more  than  each  single  drug  dose,  hence 
indicating  antagonism.  The  geometric  interpretation  of 
the  interaction  index  can  be  best  shown  graphically  in 
Figure  1,  panels  (A)  and  (B),  where  P  =  (Z>y?i,0), 
Q  =  (0,  Dy  2 ),  U  =  (d\,  J2),  all  yield  the  same  effect  y. 
If  we  draw  a  line  RS  passing  through  U  and  parallel  to 
the  line  PQ ,  and  draw  a  line  OU  intercepting  with  P  Q 
at  V ,  then,  from  the  basic  geometric  properties,  the  inter¬ 
action  index  can  be  expressed  as  the  ratio  of  .  In 

Figure  1(A),  the  closer  the  point  U  is  toward  the  origin, 
the  less  is  the  amount  of  combination  dose  required  to 
produce  the  same  effect  as  drug  1  alone  at  Dy  \  or  drug 
2  alone  at  Dy2 ,  hence,  the  stronger  is  the  synergy.  By 
the  same  token  in  Figure  1(B),  the  further  the  point  U  is 
away  from  the  origin,  the  larger  is  the  amount  of  combi¬ 
nation  dose  required  to  produce  the  same  effect  as  drug  1 
alone  at  Dy\  or  drug  2  alone  at  Dy  2,  hence,  the  stronger 
is  the  antagonism.  From  Figure  1,  we  conclude  that  the 
interaction  index  can  be  used  to  measure  the  mode  and 
magnitude  of  drug  interactions. 

Given  the  combination  dose  (d\, . . . ,  df)  and  its  ef¬ 
fect  y,  and  the  marginal  dose-effect  curves  fi(Di)  for 
drug  i  (i  =  1,  ...,&),  the  calculation  of  the  interaction 
index  at  a  combination  dose  (d\, ... ,  djf)  is  straightfor¬ 
ward.  One  simply  replaces  Dy  j  by  /-_1(y),  where  f~l  is 
the  inverse  function  of  f)  (i  =  1, ...  ,k).  However,  since 
the  dose-effect  curves  are  usually  estimated  and  the  ef¬ 
fect  y  is  observed  with  error,  to  make  valid  inferences 
for  drug  interactions,  one  needs  to  account  for  all  these 
variabilities.  In  other  words,  one  needs  to  consider  the  es¬ 
timated  interaction  indices  along  with  their  variances  to 
make  valid  statistical  inferences  on  drug  interaction. 

In  most  settings,  the  functional  form  of  the  marginal 
dose  response  curves  may  not  be  known  and  need  to  be 
estimated  from  the  data.  Chou  and  Talalay  (1984)  pro¬ 
posed  the  median-effect  equation  which  has  been  widely 
used  to  model  the  dose-effect  curve  with  good  success 
(Chou  2006).  In  our  cell  line  study  (Kong  and  Lee  2006; 
Lee  et  al.  2007),  we  also  find  that  the  median-effect  equa¬ 


tion  fits  the  data  well.  Chou  and  Talalay ’s  median-effect 
equation  has  the  following  form 


where  d  is  the  dose  of  a  drug  eliciting  effect  E ,  Dm  is  the 
median  effective  dose  of  a  drug,  and  m  is  a  slope  parame¬ 
ter  depicting  the  shape  of  the  curve.  When  m  is  negative, 
the  curve  described  by  Equation  (2)  falls  with  increasing 
drug  concentration;  when  m  is  positive,  the  curve  rises 
with  increasing  drug  concentration.  The  median-effect 
Equation  (2)  is  independent  of  the  drug’s  mechanisms  of 
action  and  does  not  require  knowledge  of  conventional 
kinetic  constants  (Chou  2006;  Greco  et  al.  1995).  Under 
the  assumption  that  the  dose-effect  curves  follow  Chou 
and  Talalay ’s  median-effect  equation,  in  Section  3  we  in¬ 
vestigate  the  characteristics  of  the  interaction  index  and 
its  logarithmic  transformation,  and  propose  a  procedure 
to  construct  the  confidence  interval  for  the  estimated  in¬ 
teraction  index. 

Although  interaction  index  can  be  estimated  at  each 
combination  dose  separately,  this  approach  is  not  effi¬ 
cient.  The  result  tends  to  be  more  varying  as  it  depends 
on  only  measurement  at  a  single  combination  dose  level. 
To  gain  efficiency,  one  can  assume  a  model  and  pool  data 
at  various  combination  doses  to  form  a  better  estimate 
of  the  interaction  index.  One  commonly  used  approach 
is  applying  the  ray  design.  Chou  and  Talalay  (1984)  and 
Chou  (1991)  proposed  a  procedure  to  characterize  a 
two-drug  interaction  by  first  fitting  marginal  dose-effect 
curves  and  a  dose-effect  curve  for  the  combination 
doses  with  their  components  at  a  fixed  ray  (i.e.,  d\/d2=a 
constant,  forming  a  ray  in  the  d\  x  J2  dose  plane),  then 
assessing  drug  interaction  based  on  the  plot  of  their  com¬ 
bination  indices  versus  effects  for  combination  doses  at 
this  fixed  ray.  The  confidence  intervals  for  the  combina¬ 
tion  indices  were  constructed  by  Monte  Carlo  techniques 
(CalcuSyn  at  http://www.biosoft.eom/w/calcusyn.htm ; 
Belen’kii  and  Schinazi  1994;  CompuSyn  at 
http://www.combosyn.com/).  The  combination  index 
has  the  same  form  as  the  interaction  index  when  the 
combined  drugs  are  mutually  exclusive.  However,  Chou 
and  Talalay ’s  mutual  exclusiveness  and  nonexclusiveness 
criteria  are  difficult  to  evaluate,  and  the  combination  in¬ 
dex  has  been  criticized  by  many  researchers  (Berenbaum 
1989;  Greco  et  al.,  1995).  In  Section  3,  by  adopting  the 
interaction  index  for  Chou  and  Talalay ’s  method,  we 
extend  their  method  to  assess  drug  interactions  among 
k(>  2)  drugs,  and  propose  a  procedure  to  construct  a 
pointwise  (1  —  a)  x  100%  confidence  bound  for  the 
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A.  Synergy  B.  Antagonism 


Figure  L  Illustration  of  the  interaction  index.  P  =  (Dy  0)  represents  the  drug  1  dose  producing  effect  y,  Q  =  (0,  Dy  2)  represents  the  drug  2 
dose  producing  effect  y,  and  U  =  (d\,d2)  represents  the  combination  dose  producing  the  same  effect  y.  RS  is  the  line  passing  by  U  and  parallel 
to  PQ,  and  V  is  the  intercept  of  OU  and  PQ.  The  interaction  index  can  be  expressed  as  the  ratio  of  •  Panel  A  illustrates  the  case  for 


synergy  while  Panel  B  for  antagonism. 


estimated  curve  of  interaction  indices  versus  effects 
by  accounting  for  all  the  variabilities  in  estimating 
the  dose-effect  curves.  In  Section  4,  we  present  the 
results  from  simulations  and  case  studies  to  show  that 
our  proposed  procedure  performs  at  least  as  well  as 
the  Monte  Carlo  procedure  in  terms  of  covering  the 
underlying  curves  and  shortening  the  confidence  bound, 
and  performs  better  in  terms  of  taking  less  time  to 
compute.  The  last  section  is  devoted  to  discussion. 


2.  Interaction  Index  and  its  Confidence 
Interval  at  a  Combination  Dose 

In  this  section,  we  will  present  how  to  construct  a  con¬ 
fidence  interval  for  interaction  index  at  a  combination 
dose  (d\, ... ,  dk )  with  an  effect  y.  Here  the  dose-effect 
curve  for  drug  i  (i  =  l, ...  ,k)  is  estimated  from  the 
marginal  data  with  only  zth  drug  being  applied.  Note  that 
Chou  and  Talalay’s  median-effect  equation  (2)  also  can 
be  rewritten  as 

E 

log  - -  =  m  (log  d  -  log  Dm )  =  /?o  +  pi  log  d,  (3) 

1  —  E 


where  /?o  =  —m  log  Dm  and  p\  =  m.  The  dose  produc¬ 
ing  effect  E  can  be  written  as  either 


(  E  \m 

d  —  Dm  (  - - -  , 

(4) 

\l  -  E  J 

or 

"““pOC-e)  ■ 

(5) 

Suppose  model  (3)  has  the  form  log  y^ 

=  A,  + 

1  log  d  +  c  with  £  following  Af(0,  cr2),  then  we  may 
regress  log  y^—  on  log  d  to  get  the  marginal  dose-effect 

curve  log  y^  =  fio,i  +  fiijlogd  for  drug  i  with 
i  =  1  Meanwhile  we  may  get  the  variances 

and  covariances  for  the  estimates  fio,i  and  fi\j  for  i  = 
l, ...  ,k.  If  the  observed  mean  effect  at  a  combination 
dose  (d\, ... ,  dk)  is  y,  then,  based  on  (5),  the  associated 
interaction  index  can  be  estimated  by 

k  i  k  j 

f=z^=z — — — <«> 

i=1  y'1 

The  simulations  in  Section  4  indicate  that  the  distribution 
of  log(r)  is  approximately  normal,  while  f  deviates  from 
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a  normal  distribution  for  large  o ’s.  Thus,  we  should  ap¬ 
ply  the  delta  method  (Bickel  and  Doksum  2001)  to  log  r 
instead  of  r ,  then  we  take  the  exponential  transformation 
to  get  the  confidence  interval  for  r.  By  applying  delta 
method  to  log(f ),  we  get 


var(log(r)) 

-  var(f) 


^  1 1 

(  di 

di 

di 

di 

di 

“  f2' 

’  dfio ,k 

dfh.k 

’  dy 

xS  j 

^  di 

di 

di 

di 

di' 

- ~ - ?  •  • 

•  5  - Z -  , 

\ 

SPo,k 

dfil  ,k 

dy , 

,(7) 


where 


di  di  1  di  di  l°g  \-y  fio,i 

dfioj  ~  DyJ  ’  dfiU  -  DyJ  Pu 

for  i  =  l, ...  ,k,  and 

di  1  /  1  di  |  |  1  dk  \ 

dy  yO--y)\0i,iDyA  hkby,k)’ 

li  is  the  variance-covariance  matrix  of  the  2k  param¬ 
eters  (fio,x,  fix,x,  •  •  • ,  fio,k>  fix ,k)  and  the  observed  mean 
effect  y  at  (d\, ...  ,dk).  Any  two  pairs  of  parameters, 
0o,i,  fi\,i)  and  (fajJij)  when  i  /  j  are  indepen¬ 
dent  since  typically,  different  experimental  subjects  was 
used  for  drug  i  alone  and  for  drug  j  alone,  respectively. 
Further,  all  those  subjects  are  different  from  the  subjects 
administrated  the  combination  dose  (d\, ...  ,dk).  Thus, 
the  estimates  (fioj ,  fi\j  )  are  independent  of  the  estimates 
(fioj ,  fixj)  when  i  j ,  and  all  of  them  are  indepen¬ 

dent  of  the  observed  mean  effect  y  at  (d\ , . . . ,  dk).  There¬ 
fore,  £  is  a  block  diagonal  matrix  with  the  block  being  a 
2x2  matrix  except  for  the  last  diagonal  element  var(y). 
An  approximate  variance  of  log(f )  can  be  obtained  by 
var(log(f))  ±2  var(r),  where 


var(f)  ^ 


Ms:)' 


(var(/?0  i)  2cov(/?o,j>  A,(')(l°g  yzj  “  h,i) 

TfT+  ~  % 

var(/5i>(')(log  -  p0,i): 


+ 


*  )  , 

(1  d\  1  dk  \ 

P\,\  Dy<  1  P\  £  Dy,k  ) 

fe>)’ 


var(y). 


(8) 


We  can  estimate  var(y)  in  two  ways.  When  there  are 
replicates  at  the  combination  dose  (d\,  . . . ,  <4),  var(y) 
can  simply  be  estimated  by  the  sample  variance  at 
(d\,d2).  Otherwise,  we  may  borrow  the  information 
from  estimating  the  marginal  dose-effect  curves.  Note 
that  var(log  y^)  ^  (ypfzyy)2var(y),  thus,  we  may  sub¬ 
stitute  (^jz^y)2var(y)  by  the  average  of  the  squared 
residuals  obtained  from  fitting  the  median-effect  Equa¬ 
tion  (3)  for  all  drugs  involved  assuming  a  constant  vari¬ 
ance  for  both  the  single  and  combination  drug  effects. 
Once  the  variance  is  obtained,  a  (1  —  a)xl00%  confi¬ 
dence  interval  for  log(r)  can  be  formed  as 

[log(r)  -  ?„_2t;|\/var(log(f)), 

log(f)  +  tn_2k  a  >/var(log(f))J  , 

where  tn_2k  «  is  the  1  —  |  percentile  of  t -distribution  with 

n  —  2k  degree  of  freedom,  and  n  =  X/=i  ni  with  ni 
(i  =  1 , . . . ,  k)  being  the  number  of  observations  when 
drug  i  is  used  alone.  2k  is  the  total  number  of  estimated 
parameters  involved  in  estimating  the  interaction  index 
(6).  Thus,  a  (1  —  a)  x  100%  confidence  interval  for  r  can 
be  constructed  as: 

f  exp  «  \/var(log(r)))  , 

i  exp  Aar(log(f)))]  .  (9) 


When  var(log(f ))  is  small,  we  have 


r  exp 


(±tn-2k,Cv'dr(]og(iy)) 

-  2j 2  V var(f 


±2  z  exp 


-f  ±f„-2.,|Vvar(f). 


Therefore,  if  the  error  in  (3)  is  small,  the  confidence  in¬ 
terval  for  r  based  on  (9)  is  essentially  the  same  as  the 
confidence  interval  constructed  by  directly  applying  the 
delta  method  to  f ,  which  is 


I  ~  K-2 *,«Vvar(r),  f  VvaKfi]  •  (10) 

In  Section  4,  we  illustrate  that,  for  a  large  error  in  (3), 
the  confidence  interval  (9)  behaves  better  than  (10)  in 
two  aspects:  (i)  the  lower  limit  is  greater  than  zero  all 
the  time;  and  (ii)  the  confidence  interval  has  a  coverage 
rate  that  is  closer  to  the  nominal  rate.  Therefore,  the  con¬ 
fidence  interval  (9)  is  preferred.  When  n  —  2k  is  large,  say 
n  —  2k  >  20,  one  may  use  instead  of  tn_2k  „  in  esti¬ 
mating  the  confidence  intervals  (9)  and  (10),  where  z«  is 
the  1  —  |  percentile  of  the  standard  normal  distribution. 
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3.  Interaction  Indices  and  Their  Confidence 
Bound  at  a  Fixed  Ray 


3.1  Two-Drug  Combination 


Note  that  the  confidence  interval  (9)  is  based  on  a  sin¬ 
gle  observation  and  the  marginal  dose-effect  curves,  and 
the  estimated  interaction  index  and  its  confidence  interval 
are  greatly  influenced  by  this  single  observation.  Chou 
and  Talalay  (1984)  used  a  ray  design  to  assess  drug  in¬ 
teractions.  The  advantage  of  their  method  is  that  it  uses 
all  observations  with  the  component  doses  at  a  fixed  ray. 
We  adopt  the  interaction  index  instead  of  the  combination 
index  when  using  their  approach.  The  basic  idea  (Chou 
1991)  is  to  regress  log  on  log  D  for  each  of  the  two 
drugs  used  alone  and  regress  log  on  log(di  +  J2) 
for  the  combination  doses  (^1,^2)  with  ^  say, 

log  jzrp  =  0o,c  +  0i,c  log  Dc.  Then  for  each  fixed  effect 
y,  one  may  estimate  the  interaction  index  by 


f)  0Ji _ 

uy>ccoi+(a2 

Dy,  1 


f)  0J2 

Uy>ca>i+(D2 

Dy,  2 


(ID 


1 

where  Dy,i  =  exp(-|^)  (y^)^1’1  ,  Dy,2  = 

exp(“te)(*)*- and  =  exi,(“fc) 

Commercial  software  CalcuSyn  and  Com- 
puSyn  are  available  for  estimating  the  interaction  indices 
and  their  confidence  intervals.  The  confidence  intervals 
for  interaction  indices  in  (1 1)  were  constructed  based  on 
Monte  Carlo  techniques  and  the  normal  assumption  on 
the  parameters  (Belen’kii  and  Schinazi  1994).  Briefly, 
the  parameters,  (0oj,  0\j)  for  i  =  1,  2,  c,  and  the  inter¬ 
action  index  (11)  are  estimated  from  the  observed  data, 
then  certain  number  of  random  samples  (say,  500)  of  the 

parameters,  (0o,i,  0\,i)j  for  j  =  1,...,500  and  i  = 
1,  2,  c,  are  generated  based  on  their  estimated  values  and 
covariances  under  the  normal  assumption  on  each  pair 
of  these  parameters.  Thus,  500  interaction  indices,  zCT  . 
( j  =  1, . . . ,  500),  can  be  calculated  and  its  standard  devi¬ 
ation  can  be  estimated  as  j  ~  *ct)2- 

Consequently,  the  confidence  interval  can  be  constructed 
as  [zCT  -z«aT,  *ct  +  In  the  simulation  and  case 

studies  in  Section  4,  we  used  t  £  a  instead  of 

ni+n2+nc-6,  2 

since  the  number  of  observations  is  small. 

In  the  following  subsection,  we  extend  Chou  and  Ta- 
lalay’s  method  to  k(>  2)  drugs,  estimate  drug  interaction 
at  a  fixed  ray,  say  d\  :  :  •  •  •  :  dk  =  co\  :  CD2  :  •  •  •  :  cok , 

and  construct  a  (1  —  a)  x  100%  confidence  interval  for  the 
constructed  interaction  index  at  each  effect  y.  Thus,  by 
varying  y,  a  pointwise  confidence  bound  for  the  curve  of 


interaction  indices  versus  effects  with  combination  doses 
at  the  fixed  ray  can  be  constructed  by  using  the  delta 
method. 


3.2  A>drug  combination 

Again,  we  assume  that  the  fitted  dose-effect  curve 
is  log  =  0oj  +  fil, i  log  Di  +  6  for  drug  i  with 
i  =  1, . . . ,  k.  The  fitted  dose-effect  curve  for  the  mix¬ 
ture  with  their  component  doses  at  a  fixed  ray  with  d\  : 
d2  :■■■■■  dk  =  (o\  :  co2  :  •  •  •  :  cok  is  log  = 

0O,c  +  0\,c  log  Dc  +  c  with  Dc  =  d\  +  J2  +  •  •  •  +  dk . 
Then,  for  each  fixed  effect  y,  one  may  estimate  interac¬ 
tion  index  by 


D 


CO  1 


y>C  CD\Jr...+(Qk 


D 


(Ok 


D 


+  •••  + 


y>C  COl+...+CQk 


D 


(12) 


y,k 


where  Dyj  =  exP  ^°r  *  = 

1,  •  •  • ,  k,  c.  Again,  (0o,i,0l,i)  and  (0oj,0ij)  are  inde¬ 
pendent  as  long  as  i  /  j  for  i,  j  =  1, ,  k,  c.  Based 
on  the  delta  method  (Bickel  and  Doksum  2001),  we  can 
obtain  an  approximate  variance  for  zCT 


var(fcr)  = 


/  dzCT  \ 

_i_  ( cy_cj_  \ 

\dby,c) 


var  (Dyj) 


var  (DyiC) 


)  var(4 

“TV  (Thcoi)D2y.J 

;= 1  Dy,i  / 


)  (13) 


with 


var  (Dyj)  = 


SDyJ  8DyJ  \ 

\  dfio,i  ’  Sfh,i  ) 


'SDyJ 

sk),i 

|  dDyJ 


D: 


/ _ 1_  Am  -  !°g  -pj  \ 

''\  E’  Wj  ) 


1  -  ki 

x  *h>Ai  I  A 
ph 


for  i  =  c,  respectively.  Thus,  replacing 

var (Dy,i)  in  (13),  we  can  obtain  the  estimated  variance 
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for  ir 


var(fCT)  = 


■^-1  /  COi  Dy  e  \ 
i= 1  \(Sf=l  0)i)by,i  ) 


/ var(/?0  i )  2c°v(/?o ,i ,Pu) (log  ~  A),/ ) 

l7ir+ 


+ 


var(/?u)(log  f. zy  ~  A) 


/A, 


+ 


/  by£  ( \\ 

vz?=M£t  4,7/ 


(var(/?0  c)  2cov(/?o,c,  A,c)(log  yAy  -  A>,c) 

"1FT+ 


+- 


var(^i;C)(log  -py  -  Ao,c): 


Pi 


Pi 

) 


(14) 


(d\,  d2).  In  Section  3,  we  have  the  observations  for  com¬ 
bination  doses  at  a  fixed  ray,  then  we  fit  the  dose-effect 
curve  for  this  ray.  We  estimate  the  interaction  index  for 
each  fixed  effect,  where  the  combination  dose  produc¬ 
ing  such  an  effect  is  estimated.  Thus,  the  second  term  in 
(14)  describes  the  uncertainty  contributed  by  the  variance 
of  the  estimated  combination  dose  Dy,c,  which  could  be 
split  into  the  estimated  combination  dose 


4.  Simulations  and  Case  Studies 

4.1  Simulations 

To  examine  whether  the  confidence  intervals  proposed 
in  Sections  2  and  3  have  proper  characteristics,  we  per¬ 
formed  simulations  in  the  following  two  scenarios. 


Here,  we  prefer  the  confidence  interval  based  on  the 
delta  method  on  log(fcr)  since  log(fcr)  is  more  approx¬ 
imately  normally  distributed  than  iCT .  A  (1  —  a)  x  100% 
confidence  interval  for  zCT  can  be  constructed  by 


zCT  exp 


^  "+^  2*  2 ~2  yvar(fcr)^; 


zCT  exp 


(n+nc— 2k— 2,  y  j  ~~  ~  | 

- 7 - VVar(Tcr)  I 

ZCT  / 


(15) 


Again  n  =  2/=i  ni  and  ni  (l  =  1, . . . ,  k)  is  the  number 
of  observations  when  drug  i  is  used  alone,  nc  is  the  num¬ 
ber  of  observations  for  the  combination  doses  at  a  fixed 
ray.  By  varying  y  in  different  values,  we  can  construct  a 
pointwise  (1  —  a)  100%  confidence  bound  for  the  curve 
of  interaction  indices  versus  effects.  Thus,  we  can  assess 
drug  interactions  for  combination  doses  at  the  fixed  ray 
while  considering  the  stochastic  uncertainty  in  obtaining 
the  observations. 


Remark:  Comparing  the  variances  of  estimated  inter¬ 
action  indices  in  (8)  and  (14),  we  note  that  the  first  terms 
in  both  equations  are  approximately  the  same,  while  the 
second  terms  are  markedly  different.  In  Sections  2  and  3, 
the  k  dose-effect  curves  for  all  the  drugs  involved  are  es¬ 
timated.  The  first  terms  in  both  equations  describe  the  un¬ 
certainty  contributed  by  estimating  the  k  marginal  dose- 
effect  curves.  In  Section  2,  we  estimate  the  interaction  in¬ 
dex  based  on  the  observed  mean  effect  at  a  single  combi¬ 
nation  dose,  and  oftentimes,  we  assume  the  combination 
dose  is  measured  without  error.  Under  this  setting,  the 
second  term  in  (8)  describes  the  variability  contributed 
by  the  mean  of  the  observed  effects  at  the  combination 


Scenario  1:  three  drugs,  at  a  single  combination 
dose.  In  the  first  scenario,  we  simulated  three  drugs  that 
followed  the  median-effect  Equation  (2)  with  the  same 
slope  m  =  —  1  and  different  median  effective  doses: 
Dm  i  =  1,  Dm2  =  2,  and  Dm3  =  4,  respectively.  We 
took  the  combination  dose  (d\,d2,  d3)  with  each  com¬ 
ponent  being  one  third  of  its  associated  median  effec¬ 
tive  dose,  that  is,  (d\,d2,d3)  =  (^p-,  = 

(^,  |).  If  the  combination  dose  is  additive,  the  ex¬ 

pected  effect  will  be  0.5.  Let  us  denote  the  interaction 
index  at  this  combination  dose  as  r,  the  effect  as  E ,  then 
based  on  Equation  (4),  we  have 

d\ _ ^ _ d2 _ ^ _ d3  _  ^ 

Dmi  (yz^)  Dm2  (yz^)  m  Dm3  (y^) 

Thus,  the  effect  at  ( d\  ,d2,d3 )  can  be  explicitly  expressed 
as 

_  (T_1  +  Imfi) 

iWr-i  TjEfjEfjEvf' 

1  ^  y  y  Dm  i  ^  Dm2  ^  Dm3  ) ) 

We  vary  r  among  (0.2,  0.4,  0.6,  0.8,  1,  1.25,  1.67,  2.5,  5). 
The  corresponding  effect  E  will  be  (0.167,  0.286,  0.375, 
0.444, 0.5, 0.556, 0.625, 0.714, 0.833),  respectively.  Note 
that  the  slope  m  is  negative,  so  the  dose-effect  curve  is 
decreasing.  If  the  effect  is  less  than  0.5,  then  the  combi¬ 
nation  dose  will  be  synergistic  and  the  interaction  index 
will  be  less  than  1 ;  whereas  if  the  effect  is  greater  than 
0.5,  the  combination  dose  will  be  antagonistic  and  the  in¬ 
teraction  index  will  be  greater  than  1 .  It  is  obvious  that 
the  farther  the  interaction  index  moves  away  from  1 ,  the 
stronger  is  the  interaction  effect. 
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Table  1.  Simulation  results  for  Scenario  1:  a  fixed  combination  dose  (d\,d,2,  d^)  =  (y ,  |)  but  with  varying  interaction  indices.  The  three 

dose-effect  curves  follow  the  median-effect  equation  with  m  =  —  1  and  (Dm Dm2,  Dm 3)  =  (1,  2,  4). 


T 

0.2 

0.4 

0.6 

0.8 

1 

1.25 

1.67 

2.5 

5 

mean(f) 

0.202 

0.402 

0.601 

0.804 

1.001 

1.266 

1.676 

2.503 

5.022 

Cov.rate 

0.950 

0.952 

0.946 

0.947 

0.939 

0.958 

0.951 

0.947 

0.948 

Cov.rate.log 

0.954 

0.955 

0.950 

0.950 

0.941 

0.953 

0.951 

0.951 

0.954 

G  =0.1 

Len.ci 

0.093 

0.180 

0.266 

0.353 

0.439 

0.555 

0.737 

1.109 

2.276 

Len.ci.log 

0.094 

0.181 

0.268 

0.357 

0.443 

0.560 

0.744 

1.119 

2.298 

Pct.syn.log 

100 

100 

99.4 

51.7 

3.4 

0 

0 

0 

0 

Pct.add.log 

0 

0 

0.6 

48.2 

94.1 

46.4 

0.2 

0 

0 

Pct.ant.log 

0 

0 

0 

0.1 

2.5 

53.6 

99.8 

100 

100 

mean(f) 

0.220 

0.435 

0.644 

0.879 

1.080 

1.419 

1.850 

2.753 

5.664 

Cov.rate 

0.930 

0.933 

0.918 

0.923 

0.916 

0.945 

0.941 

0.920 

0.931 

Cov.rate.log 

0.956 

0.959 

0.952 

0.952 

0.945 

0.956 

0.953 

0.952 

0.956 

II 

O 

Len.ci 

0.397 

0.777 

1.151 

1.562 

1.926 

2.552 

3.381 

5.108 

10.966 

Len.ci.log 

0.463 

0.901 

1.336 

1.812 

2.237 

2.971 

3.972 

6.018 

13.122 

Pct.syn.log 

94.6 

52.3 

20.8 

6.5 

3.1 

0.5 

0.1 

0 

0 

Pct.add.log 

5.4 

47.7 

79.2 

92.2 

94.5 

91.1 

79.2 

48.4 

5.9 

Pct.ant.log 

0 

0 

0 

1.3 

2.4 

8.4 

20.7 

51.6 

94.1 

Under  the  above  setting,  we  first  generated  six  equally 
spaced  doses,  ranging  from  0.1  to  three-fold  of  the  as¬ 
sociated  median  effective  dose  for  each  drug.  We  then 
generated  the  effects  based  on  the  model  log  = 
/?0  +  /?i  lo gd  +  e  with  e  ~  N(0,  a2)  for  each  drug, 
where  fio  =  —m  log  Dm  and  =  m.  We  generated  the 
observed  effect  at  the  combination  dose  (y ,  |)  with 

the  same  size  of  the  stochastic  variation.  The  total  sam¬ 
ple  size  for  each  simulated  experiment  was  19  (six  ob¬ 
servations  for  each  single  drug  and  one  observation  for 
the  combination  dose  effect).  We  fitted  each  dose-effect 
curve  based  on  the  generated  data.  Then,  for  each  r ,  we 
estimated  the  interaction  index  based  on  (6),  constructed 
its  95%  confidence  intervals  based  on  (9)  and  (10),  re¬ 
spectively,  calculated  the  length  of  the  confidence  inter¬ 
vals,  and  counted  whether  the  true  r  lies  in  the  respective 
confidence  intervals  based  on  (9)  and  (10),  and  whether 
the  confidence  interval  based  on  (9)  lies  below  1,  con¬ 
tains  1,  or  lies  above  1.  We  repeated  this  procedure  1,000 
times,  and  averaged  all  the  above  quantities.  We  summa¬ 
rized  the  results  in  Table  1  under  different  settings  for 
a\  a  =  0.1  and  g  —  0.4,  respectively.  From  Table  1, 
we  conclude  that  (a)  the  estimation  for  r  (mean(f)  in 
Table  1)  is  close  to  the  true  value  and  the  accuracy  de¬ 
creases  as  g  increases;  (b)  the  resulting  coverage  rates 
(Cov.rate.log)  based  on  confidence  interval  (9)  are  closer 
to  the  nominal  coverage  rate  of  95%  than  those  (Cov.rate) 
based  on  (10),  particularly,  when  g  is  larger;  (c)  the  aver¬ 
age  lengths  of  the  confidence  intervals  (Len.ci.log)  based 
on  (9)  and  the  average  lengths  of  the  confidence  inter¬ 
vals  (Len.ci)  based  on  (10)  increase  as  g  increases  with 
Len.ci.log  slightly  larger  than  Len.ci  to  provide  the  nom¬ 


inal  coverage  rate;  and  (d)  the  percentage  of  times  the 
model  correctly  assesses  drug  interaction  based  on  (9) 
as  synergy  (Pct.syn.log),  additivity  (Pct.add.log),  or  an¬ 
tagonism  (Pct.ant.log)  decreases  as  g  increases.  For  each 
underlying  interaction  index  among  (0.2,  0.6,  1,  1.67,  5), 
we  obtained  the  Q-Q  plot  of  the  1,000  estimated  interac¬ 
tion  indices  (Figure  2,  Columns  B1  and  B3)  as  well  as 
the  Q-Q  plot  of  the  1,000  logarithms  of  the  estimated  in¬ 
teraction  indices  (Figure  2,  Columns  B2  and  B4)  under 
the  settings  g  =  0.1  and  g  =  0.4,  respectively.  From 
Figure  2,  it  is  clear  that  for  small  g  (e.g.,  g  =  0.1  ),  both 
the  estimated  interaction  index  and  the  logarithm  of  the 
estimated  interaction  index  are  approximately  normally 
distributed.  But,  when  g  becomes  large,  say  g  =  0.4,  the 
estimated  interaction  indices  deviate  from  a  normal  dis¬ 
tribution,  while  the  logarithms  of  the  estimated  interac¬ 
tion  indices  are  still  approximately  normally  distributed. 
Therefore,  one  would  expect  that  the  delta  method  on  the 
logarithm  of  the  interaction  index  would  work  better  for 
constructing  confidence  intervals  for  interaction  indices. 
This  assertion  has  been  verified  by  the  results  of  the  cur¬ 
rent  simulation  studies.  Thus,  we  prefer  using  confidence 
interval  (9)  over  (10)  for  the  interaction  index  in  Section 
2,  and  using  the  confidence  interval  (15)  in  Section  3. 

Scenario  2:  Two  drugs,  with  a  ray  design.  The  sec¬ 
ond  scenario  involves  two  drugs  that  have  the  same  dose- 
effect  curves  as  drug  1  and  drug  2  in  the  first  scenario. 
That  is,  the  two  dose-effect  curves  follow  the  median- 
effect  Equation  (2)  with  the  same  slope  m  =  —  1,  and 
median  effective  doses:  Dm\  =  1  and  Dm2  =  2,  re¬ 
spectively.  We  assume  that  the  dose-effect  curve  for  the 


10 


Confidence  Intervals  of  Interaction  Index  for  Assessing  Multiple  Drug  Interaction 


A1 :  11=0.2 
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Figure  2.  Q-Q  plots  of  estimated  interaction  indices  (Column  B1  and  B3)  and  Q-Q  plots  of  the  logarithms  of  the  estimated  interaction  indices 
(Column  B2  and  B4)  for  1000  samples  under  Scenario  1.  Columns  B1  and  B2  show  the  Q-Q  plots  under  o  =0.1,  and  Column  B3  and  B4  show 
the  Q-Q  plots  under  o  =  0.4. 


combination  doses  ( d\,di )  at  the  fixed  ray,  say,  ^  =  y, 
follows  the  median-effect  Equation  (2)  with  Dm  12  =  1.5 
and  myi  —  —2. 

We  generated  five  equally  spaced  doses,  ranging  from 
0.1  to  three-fold  of  the  associated  median  effective  dose 
for  each  of  the  single  drug,  and  five  equally  spaced  doses, 
ranging  from  0.5  to  three-fold  of  the  associated  median 
effective  dose  for  the  mixture  (d\ ,  J2)  at  the  fixed  ray 
with  ^  =  y  anc*  with  the  dose  in  the  median  effect  Equa¬ 
tion  (2)  being  d\  +  ^2-  We  then  generated  the  effects  as¬ 
sociated  with  these  generated  doses  based  on  the  model 
log  \~p  =  A)  +  fil  log d  +  £  with  £  ~  Af(0,  a2)  for  the 
two  drugs  and  their  mixture  under  the  settings  a  =  0.2 
and  a  =0.4,  respectively.  The  total  sample  size  for  each 
simulated  experiment  is  15  (five  observations  for  each 


single  drug  and  five  observations  for  combination  doses). 
In  addition,  under  each  setting  for  a ,  we  generated  seven 
samples  for  illustration.  For  each  sample,  we  first  fitted 
the  dose-effect  curves  for  the  two  drugs  and  the  mix¬ 
ture,  and  then  performed  the  following  steps:  we  (a)  esti¬ 
mated  the  interaction  indices  based  on  (12)  for  42  equally 
spaced  effect  levels  between  the  range  of  0.1  to  0.95; 
(b)  constructed  their  confidence  intervals  based  on  (15) 
and  on  a  Monte  Carlo  simulation  proposed  by  Belen’kii 
and  Schinazi  (1994),  respectively;  and  (c)  estimated  in¬ 
teraction  index  (6)  and  constructed  the  confidence  inter¬ 
val  (9)  for  each  observed  combination  dose.  Figure  3  il¬ 
lustrates  plots  of  the  underlying  curve  of  the  interaction 
index  versus  effect  (solid  line),  the  pointwise  95%  confi¬ 
dence  bound  for  this  curve  based  on  (15)  (dotted  lines), 


II 


Statistics  in  Biopharmaceutical  Research:  Vol.  1 ,  No.  1 


the  confidence  bound  based  on  the  Monte  Carlo  simula¬ 
tion  (dashed  lines),  and  the  estimated  interaction  indices 
(dots)  and  their  confidence  intervals  (vertical  bars)  for 
the  observed  doses  based  on  (9)  for  the  seven  illustrative 
samples  under  each  setting  for  a .  From  Figure  3,  we  con¬ 
clude  that  (a)  the  pointwise  95%  confidence  bound  (dot¬ 
ted  lines)  embrace  the  true  curve  (solid  line)  well;  (b)  the 
pointwise  confidence  bounds  based  on  (15)  (dotted  lines) 
are  similar  to  those  based  on  Monte  Carlo  simulations 
(dashed  lines)  when  a  is  small,  but  perform  better  when 
o  is  large;  and  (c)  the  confidence  intervals  based  on  sin¬ 
gle  observations  (vertical  bars)  are  generally  wider  than 
both  confidence  bounds,  and  the  conclusions  based  on 
confidence  bounds  are  more  accurate  than  those  based  on 
the  confidence  intervals  (vertical  bars)  for  single  observa¬ 
tions.  In  addition,  for  each  sample,  we  calculated  the  ratio 
of  the  length  of  the  confidence  interval  based  on  (9)  and 
the  confidence  interval  based  on  Monte  Carlo  simulation 
described  in  Section  3  for  each  of  the  observed  effects  at 
the  combination  doses  at  the  fixed  ray.  Under  o  —  0.2, 
the  ratios  for  the  seven  samples  have  a  mean  2.05  with 
standard  deviation  0.58,  and  range  from  1.12  to  3.28.  Un¬ 
der  a  =0.4,  the  ratios  for  the  seven  samples  have  a  mean 
2.33  with  standard  deviation  0.92,  and  range  from  1.01  to 
4.15  after  removing  an  extreme  of  the  observation  with 
effect  close  to  one  (Figure  3,  Panel  B7).  Therefore,  when 
several  observations  for  combination  doses  at  a  fixed  ray 
are  available,  the  confidence  bound  derived  in  Section  3 
using  more  available  information  is  more  efficient,  thus 
preferred.  In  addition,  when  we  ran  this  simulation  to  get 
the  14  confidence  bounds  in  the  14  panels  in  Figure  3 
by  separately  using  the  confidence  interval  based  on  (15) 
and  Monte  Carlo  procedure  on  an  Intel  1.83  GHz  com¬ 
puter,  the  time  it  took  was  10  seconds  and  17  minutes, 
respectively.  It  is  clear  that  the  calculation  for  the  confi¬ 
dence  bound  based  on  (15)  is  much  faster  than  that  based 
on  Monte  Carlo  simulations. 

4.2  Case  Studies 

Our  research  group  (Lee  et  al.  2007;  Kong  and  Lee 
2006)  investigated  drug  interactions  between  two  novel 
agents,  SCH66336  and  4-HPR,  in  a  number  of  squamous 
cell  carcinoma  cell  lines  (Chun  et  al.  2003).  Here  we 
present  the  dataset  and  results  from  cell  line  UMSCC22B 
in  Table  2  for  investigating  drug  interaction  in  combina¬ 
tion  doses  at  the  fixed  ray  with  ^  =  j. 

We  first  obtained  the  dose-effect  curves  for  SCH66336 
and  4-HPR  by  a  linear  regression  of  log  y^  on  log  d , 
based  on  the  data  in  Table  2.  Recall  that  log  y^  = 
m(\ogd  —  1  ogDm)  =  T  lo gd.  The  estimates  of 
/?0,  Dm ,  and  d  for  drug  1,  drug  2,  and  the  mixture  of 
the  drugs  with  equal  concentrations  are  summarized  in 
the  same  table. 


The  transformed  data  log  y^  versus  log  d  and  the 
median-effect  plots  are  shown  in  Figure  4(A).  This  me¬ 
dian  effect  plot  indicates  that  the  data  follow  the  median- 
effect  Equation  (2)  reasonably  well.  Based  on  the  fit¬ 
ted  median-effect  equations,  we  calculated  the  interac¬ 
tion  indices  based  on  (12)  for  varied  effects  for  combina¬ 
tion  doses  at  the  fixed  ray  with  ^  =  y  and  constructed 
their  associated  confidence  bounds  based  on  (15)  and  on 
Monte  Carlo  simulations  (Belen’kii  and  Schinazi  1994), 
respectively.  Figure  4(B)  shows  the  plot  of  the  interac¬ 
tion  indices  (on  the  logarithm  scale)  versus  effects  (solid 
line)  for  combination  doses  at  this  fixed  ray  and  the  95% 
pointwise  confidence  bounds  based  on  (15)  (dotted  lines) 
and  on  Monte  Carlo  simulations  (dashed  lines).  Based 
on  the  confidence  bound  (dotted  line),  we  conclude  that 
the  combination  doses  at  the  fixed  ray  with  ^  =  y  with 
effect  less  than  0.52  are  synergistic,  and  the  combina¬ 
tion  doses  at  the  fixed  ray  with  effect  greater  than  0.52 
are  additive.  The  conclusions  based  on  the  confidence 
bounds  obtained  from  Monte  Carlo  simulations  (dashed 
lines)  are  slightly  different.  We  also  calculated  four  inter¬ 
action  indices  based  on  (6)  and  their  confidence  intervals 
based  on  (9)  at  the  four  observed  data  points  (d\,  J2)  as 
being  (0.1,  0.1),  (0.5,  0.5),  (1,  1),  and  (2,  2).  The  four 
interaction  indices  were  0.791,  0.609,  0.256,  and  0.103, 
and  their  corresponding  95%  confidence  intervals  were 
[0.202,  3.091],  [0.169,  2.193],  [0.060, 1.087],  and  [0.018, 
0.581],  respectively.  These  pairs  of  interaction  indices 
versus  effects,  along  with  their  95%  confidence  intervals 
are  shown  as  vertical  bars  in  Figure  4(B).  From  these 
four  interaction  indices  and  their  confidence  intervals,  we 
conclude  that  the  combination  doses  at  the  fixed  ray  with 
^  =  |  are  synergistic  for  doses  >  2  //M  for  each  single 
drug,  and  additive  for  doses  <  1  //M  for  each  single  drug. 
The  conclusions  from  the  two  procedures  in  Section  2 
and  3  are  slightly  different:  the  combination  doses  (1,1) 
and  (0.5,  0.5)  with  respective  observed  effect  0.3551  and 
0.4919  were  identified  as  additive  based  on  the  second 
and  third  vertical  bar  (reading  from  left  to  right),  while 
based  on  the  confidence  bound  (dotted  lines),  the  combi¬ 
nation  doses  were  identified  as  synergistic  as  each  effect 
was  less  than  0.52.  The  relative  length  of  the  confidence 
interval  based  on  (9)  versus  the  monte  Carlo  confidence 
interval  ranges  from  1.27  to  3.72  for  the  four  combination 
doses  at  the  fixed  ray.  Once  again,  this  example  shows 
that  the  confidence  interval  estimation  based  on  a  sin¬ 
gle  observation  (9)  is  not  as  efficient  as  the  correspond¬ 
ing  confidence  interval  based  on  model  (15)  which  used 
more  data. 

We  also  examined  another  dataset  from  a  drug  combi- 
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A  4  :  Effect  E  A  5  :  Effect  E 


A  7  :  Effect  E 


B  6  :  Effect  E 


B  7  :  Effect  E 


Figure  3.  Simulated  results  under  Scenario  2.  The  solid  line  is  the  plot  of  the  underlying  true  interaction  indices  versus  effects  for  combination 
doses  at  the  fixed  ray  with  ^  =  j  ,  the  dotted  lines  and  the  dashed  lines  are  the  95%  pointwise  confidence  bounds  for  the  curve  of  the  interaction 
index  versus  effect  based  on  the  delta  method  and  Monte  Carlo  simulation  under  the  settings  o  =  0.2  (Panels  A)  and  o  =  0.4  (Panel  B), 
respectively,  and  the  dots  and  the  vertical  bars  are  the  estimated  interaction  indices  and  their  confidence  intervals  for  observed  combinations. 


nation  study  for  o-phenanthroline  and  ADP  on  the  inhibi¬ 
tion  of  horse  liver  alcohol  dehydrogenase,  which  was  an¬ 
alyzed  by  Chou  and  Talalay  (1984)  and  by  Belen’kii  and 
Schinazi  (1994).  The  dataset  and  the  estimated  median- 
effect  parameters  are  shown  in  Table  3.  The  median- 
effect  plots  for  the  two  drugs  and  their  mixture  ( d\ ,  J2) 
at  the  fixed  ray  with  ^  are  shown  in  Figure  5(A). 

The  plot  of  interaction  indices  versus  the  fractional  in¬ 
hibitions  at  this  fixed  ray  are  shown  as  a  solid  line  in 
Figure  5(B).  In  the  same  panel,  we  illustrate  the  point- 
wise  confidence  bound  (dotted  lines)  based  on  (15)  and 
the  pointwise  confidence  bound  (dashed  line)  based  on 


Monte  Carlo  simulations  for  this  curve,  and  the  estimated 
interaction  indices  (dots)  and  their  associated  confidence 
intervals  (vertical  bars)  based  on  (9)  for  the  combination 
doses  having  observed  effects.  Again,  the  two  95%  con¬ 
fidence  bounds  are  almost  the  same,  the  vertical  bars  are 
wider  than  the  confidence  bound,  and  the  conclusions  on 
drug  interactions  based  on  vertical  bars  and  those  based 
on  confidence  bounds  are  consistent. 

We  developed  two  S-PLUS/R  programs.  One  is  used 
to  estimate  the  interaction  index  and  its  confidence 
interval  for  a  single  combination  dose  of  multiple 
drugs,  and  the  other  is  used  to  estimate  the  pointwise 
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Table  2.  Fractions  of  squamous  cell  carcinoma  cells  (UMSCC22B)  surviving  after  72  hours  of  treatment  by  single  and  combination  dose  levels 
of  SCH66336  and  4-HPR  and  the  fitted  median-effect  parameters. 


SCH66336 
dose  (/zM) 

4-HPR  dose 
dose  (/iM) 

Fractional 

survival 

Median-effect 

parameters 

0.1 

0.6701 

fi0fl  =  0.094(0.085) 

0.5 

0.6289 

fil’l  =  -0.335(0.066) 

1 

0.5577 

Dm\  =  1.326 

2 

0.4550 

b  i  =  0.187 

4 

0.3755 

0.1 

0.7666 

A), 2  =  0.217(0.073) 

0.5 

0.5833 

Plt2  =  -0.398(0.058) 

1 

0.5706 

Dm2  =  1.726 

2 

0.4934 

<r2  =  0.129 

0.1 

0.1 

0.6539 

A), 12  =  -0.225(0.092) 

0.5 

0.5 

0.4919 

h,Yl  =  -0.596(0.082) 

1 

1 

0.3551 

Dm\  2  =  0.686 

2 

2 

0.2341 

b\2  =  0.182 

Note:  The  number  inside  the  parentheses  in  the  last  column  is  the  standard  error  of  the  estimate. 


A.  Median  Effect  Plots  B.  Inter.  Index  vs.  Effect 


the  two  dashed  lines  are  the  pointwise  95%  confidence  bounds  for  the  curve  of  interaction  index  versus  effect  based  on  the  delta  method  in  Section 
3  and  Monte  Carlo  simulation,  respectively,  and  the  dots  and  the  vertical  bars  are  the  estimated  interaction  indices  and  their  confidence  intervals 
for  observed  combinations.  The  four  vertical  bars  from  left  to  right  correspond  to  the  combination  doses  of  (2,  2),  (1,  1),  (0.5,  0.5),  and  (0.1,  0.1), 
respectively. 


14 


Confidence  Intervals  of  Interaction  Index  for  Assessing  Multiple  Drug  Interaction 


Table  3.  Inhibition  of  horse  liver  alcohol  dehydrogenase  by  o-phenanthroline  and  ADP  alone  and  in  combination  (Chou  and  Talalay  1984; 
Belen’kii  and  Schinazi  1994). 


Fractional 

Median-effect 

o-phenanthroline 

ADP 

Inhibition 

parameters 

8.7 

0.132 

fio  ti  =  -4.696(0.145) 

17.4 

0.267 

=  1.302(0.046) 

26.1 

0.411 

Dmi  =  36.803 

34.8 

0.476 

<j\  =  0.058 

43.5 

0.548 

0.5 

0.175 

fi0i2  =  -0.601(0.079) 

1.0 

0.400 

Pl,2  =  1.178(0.127) 

1.5 

0.492 

Dm2  —  1-666 

2.0 

0.542 

a2  =0.161 

2.5 

0.592 

9.2x|L4 

9.2  x 

0.507 

A), 12  =  -3.843(0.038) 

18.4  x 

18.4  Xjij 

0.769 

^1,12  =  1-739(0.012) 

v  17.4 

27,6  m 

27.6  x  jj|^ 

0.872 

Dm\ 2  =9.117 

36.8  X  ||| 
46-°XTO 

36.8  x 

460  x  Tih 

0.919 

0.944 

b\2  =  0.015 

Note:  The  number  inside  the  parentheses  in  the  last  column  is  the  standard  error  of  the  estimate. 


confidence  bound  for  the  curve  of  interaction  index 
versus  effect  for  combination  doses  at  a  fixed  ray.  The 
S-PLUS/R  code  and  the  data  example  are  available  in 
CI-ofJnte r action  Jndex,  which  can  be  downloaded  from 
http:// bio  statistics,  mdanderson.  org/SoftwareDownload/. 

5.  Discussion 

We  proposed  a  procedure  in  Section  2  to  estimate  the 
interaction  index  and  constructed  its  associated  confi¬ 
dence  interval  for  a  multiple  drug  combination.  In  most 
cases,  the  dose-effect  for  a  single  agent  is  known,  and  in¬ 
vestigators  are  interested  in  assessing  whether  drug  com¬ 
binations  are  synergistic.  When  resources  are  limited,  the 
experiment  can  be  conducted  in  only  a  limited  number 
of  combination  doses.  We  can  assess  drug  interactions 
for  those  combination  doses  based  on  the  procedure  pro¬ 
vided  in  Section  2.  Note  that  although  the  dose-effect 
curves  follow  Chou  and  Talalay ’s  median-effect  equation 
work  reasonably  well,  the  model  may  not  work  in  cer¬ 
tain  cases.  In  these  cases,  other  dose-effect  models  must 
be  sought.  For  example,  Lee  et  al.  (2009)  found  that  the 
Emax  model  describes  the  experimental  data  there  bet¬ 
ter  than  Chou  and  Talalay’s  median-effect  equation;  thus, 
the  £max  model  was  used  there.  Upon  finding  the  dose- 
effect  curves  of  any  parametric  form  which  fits  the  data, 
one  may  use  the  same  philosophy  to  estimate  the  interac¬ 
tion  index  and  construct  its  associated  confidence  inter¬ 
val  based  on  the  delta  method.  However,  using  this  “at  a 


combination  dose”  method,  one  can  assess  drug  interac¬ 
tions  only  at  combination  doses  having  observed  effects, 
and  the  drug  interaction  tends  to  be  predicted  as  additiv¬ 
ity  due  to  lack  of  efficiency  (i.e.,  wide  confidence  inter¬ 
vals)  even  with  nonadditive  drug  interactions. 

Chou  and  Talalay’s  method  based  on  a  ray  design  is 
widely  used.  We  provided  a  procedure  to  construct  point- 
wise  confidence  bound  for  Chou  and  Talalay’s  curve  of 
interaction  index  versus  effect  in  Section  3.  The  proce¬ 
dure  we  provided  avoids  extensive  calculations  used  in 
Monte  Carlo  techniques,  which  were  required  in  the  soft¬ 
ware  CalcuSyn  and  CompuSyn  and  in  the  method  pro¬ 
vided  by  Belen’kii  and  Schinazi  (1994).  From  the  sim¬ 
ulations  and  case  studies  in  Section  4,  we  find  that  the 
confidence  bounds  provided  in  Section  3  are  at  least  as 
good  as  the  confidence  bounds  constructed  using  Monte 
Carlo  techniques,  while  the  confidence  bounds  in  Sec¬ 
tion  3  are  much  faster  to  compute.  Our  limited  simula¬ 
tion  studies  also  show  that  the  approximation  based  on 
the  logarithm  transformation  and  t- statistic  works  rea¬ 
sonably  well  when  sample  size  was  as  low  as  19  in  one 
case  and  15  in  another  case. 

From  simulations  and  case  studies  in  Section  4,  it  is 
clear  that  the  confidence  intervals  based  on  single  obser¬ 
vations  (verticals  bars  in  Figures  3,  4,  and  5)  are  wider 
than  the  pointwise  confidence  bounds  based  on  a  ray  de¬ 
sign  which  use  more  data.  In  a  ray  design,  the  constructed 
confidence  bound  used  all  the  information  on  this  ray, 
therefore,  the  estimates  based  on  Section  3  will  be  more 
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A.  Median  Effect  Plots  B.  Inter.  Index  vs.  Effect 


Figure  5.  Median-effect  plots  (Panel  A)  and  the  plot  of  interaction  indices  versus  effects  (Panel  B)  for  the  combination  doses  at  the  fixed  ray 
with  ^  for  o-phenanthroline  and  ADP.  In  Panel  B,  the  solid  line  is  the  plot  of  the  estimated  interaction  indices  versus  effects,  the  two 

dotted  lines  and  the  two  dashed  lines  are  the  pointwise  confidence  bounds  for  the  curve  of  interaction  index  versus  effect  based  on  the  delta  method 
and  Monte  Carlo  simulation,  respectively,  and  the  dots  and  the  vertical  bars  are  the  estimated  interaction  indices  and  their  confidence  intervals  for 
observed  combinations. 


efficient  and  more  accurate.  By  examining  the  curve  of 
interaction  indices  versus  effects  on  several  rays  and  ex¬ 
amining  their  associated  confidence  bounds,  one  may  ob¬ 
tain  an  overall  picture  of  the  drug  interactions.  The  lim¬ 
itation  is  that  one  can  only  assess  drug  interactions  for 
combination  doses  at  these  fixed  examined  rays.  When 
a  factorial  design  or  a  uniform  design  (Tan,  Fang,  Tian, 
and  Houghton  2003)  is  used,  a  good  strategy  is  to  use  re¬ 
sponse  surface  models,  which  use  all  the  information  pre¬ 
sented  in  the  observed  data.  We  have  proposed  a  gener¬ 
alized  response  surface  model  (Kong  and  Lee  2006)  and 
a  semiparametric  model  (Kong  and  Lee  2007)  to  capture 
drug  interaction  for  all  combination  doses.  However,  re¬ 
sponse  surface  models  for  more  than  three  drugs  are  dif¬ 
ficult  to  construct.  Therefore,  to  assess  drug  interactions 
among  multiple  drugs,  the  directly  calculated  interaction 
index  and  the  plots  of  interaction  indices  versus  effects  at 
several  fixed  rays  are  still  feasible  and  remain  appealing 
methods  to  use.  The  confidence  intervals  we  provided  in 
Sections  2  and  3  are  easy  to  calculate,  and  have  a  de¬ 
sirable  coverage  rate.  Hence,  it  suggests  that  it  is  not 
necessary  to  run  extensive  Monte  Carlo  simulations  for 
obtaining  these  confidence  intervals.  Based  on  the  result 
of  this  article,  Lee  et  al.  (2009)  constructed  the  simulta¬ 
neous  confidence  interval  for  interaction  indices  over  a 
range  of  treatment  effects.  The  simultaneous  confidence 
interval  is  also  easy  to  calculate  but  is  more  conservative. 
The  proposed  confidence  intervals  can  help  us  to  gauge 
the  uncertainties  of  the  interaction  indices  for  combina¬ 


tion  doses  for  two  or  more  drugs  and  can  also  be  used  to 
provide  more  in-depth  assessment  for  drug  interactions. 
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Cure  Rate  Model  With  Mismeasured  Covariates 

Under  Transformation 

Yanyuan  Ma  and  Guosheng  Yin 


Cure  rate  models  explicitly  account  for  the  survival  fraction  in  failure  time  data.  When  the  covariates  are  measured  with  errors,  naively 
treating  mismeasured  covariates  as  error-free  would  cause  estimation  bias  and  thus  lead  to  incorrect  inference.  Under  the  proportional 
hazards  cure  model,  we  propose  a  corrected  score  approach  as  well  as  its  generalization,  and  implement  a  transformation  on  the  mismeasured 
covariates  toward  error  additivity  and/or  normality.  The  corrected  score  equations  can  be  easily  solved  through  the  backfitting  procedure, 
and  the  biases  in  the  parameter  estimates  are  successfully  eliminated.  We  show  that  the  proposed  estimators  for  the  regression  coefficients 
are  consistent  and  asymptotically  normal.  We  conduct  simulation  studies  to  examine  the  finite-sample  properties  of  the  new  method  and 
apply  it  to  a  real  data  set  for  illustration. 

KEY  WORDS:  Cure  model;  Errors-in- variables  problem;  Proportional  hazards  model;  Semiparametric  method;  Survival  fraction. 


1.  INTRODUCTION 

In  oncology  studies,  it  is  often  observed  that  a  certain  per¬ 
centage  of  subjects  are  either  cured  following  treatment  or  are 
unsusceptible  to  the  event  of  interest  and  thus  will  never  ex¬ 
perience  the  failure  (e.g.,  disease  relapse).  To  explicitly  incor¬ 
porate  the  survival  fraction  for  such  data,  cure  rate  models 
have  been  proposed  and  extensively  investigated.  The  two- 
component  mixture  cure  model  (Berkson  and  Gage  1952)  nat¬ 
urally  separates  the  entire  population  into  cured  and  noncured 
subjects, 

S(t |X)  =  0(X)  +  {1  -  0(X)}S* 0|X), 

where  0(X)  is  the  cure  probability  and  S*(t\X)  is  a  proper  sur¬ 
vival  function  for  the  uncured  population,  that  is,  lim^oo  S*(t  \ 
X)  =  0.  Its  intuitive  structure  and  ease  of  interpretation  has 
made  this  mixture  cure  model  the  focus  of  much  attention  (see, 
e.g.,  Gray  and  Tsiatis  1989;  Sposto,  Sather,  and  Baker  1992; 
Laska  and  Meisner  1992;  Kuk  and  Chen  1992;  Mailer  and  Zhou 
1996;  Sy  and  Taylor  2000;  Lu  and  Ying  2004;  Li,  Tiwari,  and 
Guha  2007).  But  the  mixture  cure  model  lacks  certain  desirable 
properties,  as  pointed  out  by  Chen,  Ibrahim,  and  Sinha  (1999). 
Moreover,  the  numerical  computation  can  be  quite  challenging 
due  to  the  additive  structure  of  the  cured  and  uncured  compo¬ 
nents. 

Alternatively,  the  proportional  hazards  cure  rate  model  devel¬ 
oped  by  Yakovlev  and  Tsodikov  (1996)  and  Tsodikov  (1998a) 
integrates  the  survival  times  of  the  cured  and  noncured  subjects 
into  one  single  formulation  of  the  survival  function, 

S(f|X)  =  exp{-0(X)F(f)},  (1) 

where  0(X)  is  a  known  link  function  and  F(t)  is  an  un¬ 
known  baseline  cumulative  distribution  function  (cdf).  The  cor¬ 
responding  cure  rate  is  5(oo|X)  =  exp{— 0(X)},  and  the  haz¬ 
ard  function  is  A(/|X)  =  0(X)/(f),  where  f(t)  =  dF (t) / dt . 
When  0(X)  =  exp(Xr/3)  and  contains  an  intercept  b ,  model 
(1)  becomes  the  usual  Cox  proportional  hazards  model  (Cox 
1972)  subject  to  the  restriction  of  a  bounded  baseline  cumu¬ 
lative  hazard  function,  given  by  A o(0  =  F(t)  exp(Z?).  Thus  a 
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cure  rate  model  has  a  bounded  cumulative  hazard,  leading  to  an 
improper  survival  function  [i.e.,  5(oo|X)  >  0],  whereas  a  non¬ 
cure  model,  such  as  the  Cox  model,  has  an  unbounded  cumu¬ 
lative  hazard,  thus  resulting  in  a  proper  survival  function  [i.e., 
5(00 1 X)  =  0].  Yakovlev  and  Tsodikov  (1996)  and  Chen  et  al. 
(1999)  provided  a  sound  biological  derivation  for  model  (1), 
and  Tsodikov,  Ibrahim,  and  Yakovlev  (2003)  provided  a  com¬ 
prehensive  review. 

In  reality,  it  is  often  the  case  that  the  covariate  X  can  be  mea¬ 
sured  only  approximately  or  indirectly,  leading  to  an  errors-in- 
variables  problem.  If  covariates  with  measurement  errors  are 
naively  taken  as  error-free,  then  severe  bias  can  be  induced  in 
the  parameter  estimates.  Fuller  (1987)  and  Carroll,  Ruppert, 
Stefanski,  and  Crainiceanu  (2006)  explored  various  methods 
for  correcting  the  bias.  The  observed  variable,  denoted  by  W, 
is  typically  related  to  the  true  covariate  X  through  a  model 
PW|x(W|X,  §),  where  §  can  be  an  unknown  parameter.  It  is 
common  to  assume  a  normal  additive  error  structure,  that  is, 
W  equals  X  plus  a  normal  random  noise.  When  this  normality 
assumption  does  not  hold,  one  needs  to  either  adapt  the  method¬ 
ology  to  treat  the  nonnormal  error  or  transform  the  covariates  X 
and  W  into  a  normal  error  form  (Nusser,  Carriquiry,  Dodd,  and 
Fuller  1996;  Eckert,  Carroll,  and  Wang  1997). 

The  Cox  model  with  measurement  errors,  has  been  studied 
extensively  in,  for  example,  the  induced  partial  likelihood  ap¬ 
proach  (Prentice  1982);  joint  models  of  survival  times  and  lon¬ 
gitudinal  co variates  measured  with  errors  (Tsiatis,  DeGruttola, 
and  Wulfsohn  1995;  Wulfsohn  and  Tsiatis  1997;  Tsiatis  and 
Davidian  2001),  the  regression  calibration  method  (Wang,  Hsu, 
Feng,  and  Prentice  1997),  pseudo-partial  likelihood  methods 
(Zucker  2005)  and  in  the  presence  of  a  validation  set  (Zhou 
and  Pepe  1995;  Zhou  and  Wang  2000;  Hu  and  Lin  2002). 
Hu,  Tsiatis,  and  Davidian  (1998)  and  Song,  Davidian,  and 
Tsiatis  (2002)  studied  semiparametric  likelihood-based  meth¬ 
ods  to  relax  the  distributional  assumption  on  the  covariates. 
Various  correction  estimators  and  corrected  scores  have  been 
provided  by  Stefanski  (1989),  Nakamura  (1990,  1992),  Kong 
and  Gu  (1999),  Buzas  (1998),  Huang  and  Wang  (2000),  Au¬ 
gustin  (2004),  Gorfine,  Hsu,  and  Prentice  (2004),  and  Song  and 
Huang  (2005).  Moreover,  measurement  error  problems  have 
been  addressed  in  other  contexts:  Kulich  and  Lin  (2000)  ex¬ 
plored  these  problems  in  the  additive  hazards  model;  Cheng 
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and  Wang  (2001),  linear  transformation  models;  Li  and  Ryan 
(2004),  with  heterogeneous  co variate  errors;  and  Li  and  Lin 
(2000),  Greene  and  Cai  (2004),  and  Hu  and  Lin  (2004),  in  ex¬ 
tensions  to  multivariate  failure  time  data. 

Although  there  is  a  rich  body  of  literature  dealing  with  mea¬ 
surement  errors  in  censored  survival  data,  all  the  aforemen¬ 
tioned  research  work  cannot  be  directly  applied  to  the  case 
with  a  cure  fraction.  To  the  best  of  our  knowledge,  measure¬ 
ment  error  issues  in  semiparametric  cure  rate  models  have  not 
been  addressed  to  date,  and  this  is  the  first  work  to  deal  with 
estimation  and  inference  in  this  regard.  We  can  appreciate  the 
difficulties  involved  in  such  models  due  to  various  unspecified 
components  and  their  interactions,  including  the  distributions 
of  the  unobservable  variables,  the  censoring  distribution,  and 
the  baseline  distribution  function.  In  fact,  we  have  found  that 
a  general  semiparametric  method  requires  one  to  either  assume 
covariate-independent  censoring  or  directly  model  the  censor¬ 
ing  mechanism,  neither  of  which  is  considered  a  satisfactory 
approach. 

This  research  is  motivated  by  a  recent  lung  cancer  study,  in 
which  the  objective  was  to  assess  the  association  of  patient  sur¬ 
vival  with  a  certain  biomarker  expression  in  the  tumor  cell  cy¬ 
toplasm.  For  each  patient,  we  had  either  one  reading  or  two 
readings  of  biomarker  expression  by  different  pathologists  to 
reduce  the  subjectivity  of  the  evaluation.  However,  neither  of 
the  two  measurements  of  biomarker  expression  could  be  con¬ 
sidered  precise.  Our  interest  lies  in  investigating  the  potential  of 
the  biomarker  as  a  new  prognostic  marker  and  therapeutic  tar¬ 
get  for  lung  cancer.  Figure  1  shows  the  Kaplan-Meier  survival 
curves  stratified  by  tumor  histology  (adenocarcinoma  or  squa¬ 
mous  cell  carcinoma).  After  approximately  7  years  of  follow¬ 
up,  we  can  see  a  stable  plateau  at  the  tails  of  the  survival  curves, 
which  indicates  the  existence  of  a  possible  cure  fraction. 

In  this  article  we  consider  the  proportional  hazards  cure  rate 
model  in  (1),  where  X  is  measured  with  errors.  The  error  struc¬ 


ture  is  not  necessarily  normal  additive,  and  multiple  measure¬ 
ments  may  exist  for  X.  Without  making  any  assumptions  on  the 
distribution  of  X,  we  propose  a  corrected  score  approach  based 
on  the  nonparametric  maximum  likelihood  estimator  (NPMLE) 
and  a  new  transformation  on  the  contaminated  covariates.  We 
show  that  the  estimators  for  /?  and  F(t )  are  strongly  consis¬ 
tent  and  converge  to  a  Gaussian  process  at  a  root -n  rate.  Due 
to  the  complex  natures  of  such  functional  measurement  error 
problem  and  NPMLE,  the  derivations  of  these  asymptotic  prop¬ 
erties  are  very  involved.  Furthermore,  the  proposed  nonpara¬ 
metric  transformation  on  the  covariates  to  improve  the  error 
normality  and  additivity  is  very  different  from  the  Box-Cox 
or  spline  transformation  described  by  Eckert  et  al.  (1997).  We 
show  that  our  transformation,  which  can  be  broadly  applied  in 
general  measurement  error  problems,  is  effective  and  easy  to 
use.  On  the  other  hand,  we  note  that  the  corrected  score  itself 
can  be  generalized  to  accommodate  nonnormal  error  structure. 
Due  to  the  proportional  hazard  structure,  the  NPMLE  and  par¬ 
tial  likelihood  estimator  are  equivalent,  and  thus  the  same  es¬ 
timator  can  be  derived  from  the  partial  likelihood  instead  of 
NPMLE.  A  more  detailed  discussion  of  this  issue  is  given  in 
Section  6. 

The  rest  of  the  article  is  organized  as  follows.  In  Section  2  we 
introduce  notation  and  propose  a  computationally  effective  es¬ 
timation  procedure  when  covariates  are  measured  with  normal 
additive  errors.  In  Section  3  we  derive  the  asymptotic  properties 
of  the  proposed  estimators  for  /?  and  F(t ),  and  in  Section  4  we 
propose  a  general  transformation  to  handle  the  nonnormal  and 
nonadditive  measurement  error  structure.  In  Section  5  we  report 
the  results  of  simulation  studies  that  we  conducted  to  evaluate 
the  finite- sample  properties  of  the  estimators,  along  with  our 
application  of  the  proposed  model  to  the  lung  cancer  data  set. 
We  give  some  concluding  remarks  in  Section  6  and  outline  the 
technical  details  of  the  proofs  of  the  theorems  in  the  Appen¬ 
dix. 


Figure  1.  Estimated  Kaplan-Meier  survival  curves  for  patients  with  lung  cancer,  stratified  by  tumor  histology  ( - squamous; 

adenocarcinoma) . 
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2.  METHODOLOGY 

For  i  =  1, . . . ,  n,  let  7)  be  the  failure  time,  let  Q  be  the  cen¬ 
soring  time  for  subject  i,  and,  correspondingly,  let  a  g -vector 
Xz-  denote  the  covariates,  the  first  component  of  which  is  1.  We 
assume  that  (7/,  A/,XZ-)  are  independent  and  identically  dis¬ 
tributed  (iid),  where  7/  =  min (7},  Q)  and  A/  =  /  (7/  <  Q)  is 
the  censoring  indicator.  Furthermore,  7}  is  conditionally  inde¬ 
pendent  of  Ct  given  covariate  X/ .  The  follow-up  time  is  infinite 
and  a  proportion  of  subjects  never  experience  failure  or  right- 
censoring,  that  is  Yi  =  oo  with  probability  one  for  those  sub¬ 
jects.  To  claim  a  subject  cured,  we  need  to  choose  a  threshold 
and  then  set  7/  =  oo  if  7/  is  larger  than  this  threshold.  In  prac¬ 
tice,  a  typical  threshold  is  the  largest  observed  event  time. 

For  simplicity  of  description,  we  assume  a  classical  measure¬ 
ment  error  model  structure,  where  the  error  is  additive  and  fol¬ 
lows  a  mean-0  normal  distribution.  The  treatment  of  the  non¬ 
normal  and/or  nonadditive  error  case  is  given  in  Section  4.  We 
formulate  the  cure  rate  model  with  covariate  measurement  er¬ 
rors  as 

S(r|X)  =  exp{-F(r)ex7^},  W  =  X  +  U, 

where  the  error  U  ~  N(0,  V).  The  observations  are  of  the  form 
{(Yi,  A/,  W/i, . . . ,  Wz>f),  i  =  1, that  is,  for  each  un¬ 
observable  X/,  we  have  r\  replicated  observations  of  W/y’s 
(j  =  1 , ,ri).  The  number  of  replicates  is  allowed  to  vary 
across  different  subjects,  and  it  also  may  occur  that  r*  =  1.  The 
case  where  some  covariates  are  error-free  is  accommodated  in 
our  model  by  setting  the  relevant  terms  in  V  to  be  0.  We  fur¬ 
ther  make  the  typical  surrogacy  assumption  that  W  and  7  are 
independent  conditional  on  X.  Thus,  given  the  unobserved  true 
covariate  X,  the  observed  covariate  W  does  not  contain  any  ad¬ 
ditional  information. 

Write  the  survival  function  and  the  probability  density  func¬ 
tion  for  the  event  of  interest  as  5g(7|X)  =  exp {—F(Y)exT &} 
and  fe(Y |X)  =  e\p{-F(Y)exT^}f(Y)exT^.  Similarly,  for  the 
censoring  times,  let  SC(F|X)  =  Pr(C  >  F|X)  and  fc(Y |X)  = 
— dSc(Y\X)/dY.  Under  the  cure  rate  model,  we  know  that 
Se(oo|X)  =  exp(- exr0)  >  0  and  Sc(oo|X)  >  0. 

If  X  is  observed,  then  the  likelihood  of  a  single  observation 
(7,  A)  given  X  can  be  written  as 

/ (7,  A|X) 

=  [{fe(Y\X)Sc(Y\X)}A{MY\X)Se(Y\X)}l-A],iY<cc) 
x  {Se(oo|X)Sc(oo|X)}/(F=oo). 

Similar  to  the  work  of  Zeng,  Yin,  and  Ibrahim  (2006),  we  con¬ 
struct  a  sieve  of  the  distribution  function  F,  and  thus  the  log- 
likelihood  is  given  by 

log/(F,  A|X)  =  A I(Y  <  oo){-F(Y)exTp  +  logF{F} 

+  XT  ft  +  logSc(F|X)} 

+  (1- A)/(F<oo) 


ease  of  exposition,  we  write  pi  =  F{7;},  denote  the  ordered 
distinct  failure  times  as  (7(i), . . . ,  7(m)),  and  denote  the  corre¬ 
sponding  jump  sizes  as  (p{\) , . . . ,  P(m))>  where  m  is  the  number 
of  distinct  failure  times.  Under  the  constraint  J2?=i  P(i )  =  1,  we 
introduce  a  Lagrange  multiplier,  A,  and  maximize 

n  /  m  \ 

J2  log  f(Y, ,  A,-  |Xj)  -  nX  I  p(l)  -  1 

i=\  \i= 1  / 

with  respect  to  (/?,  A,  /?( i) , . . . ,  p(m)).  Collecting  only  the  terms 
containing  the  unknown  parameters  (/?,  A,  F),  this  is  equivalent 
to  maximizing 

n 

J2[-F(Yi)exH}  +  A(- 1 (Yt  <  oo)(log  F{F,}  +  Xf  0)] 

1  =  1 


where  F(Yt)  =  Ey.<y.A.=1  F{Yj}  and  F(oo)  =  1. 

As  opposed  to  using  a  profile  likelihood  approach  (see  Zeng 
et  al.  2006),  we  take  a  backfitting  procedure  to  maximize  the 
log-likelihood.  To  be  more  specific,  we  solve  for  the  /?(/)’ s  and 
A  by  fixing  /?,  and  solve  for  p  by  fixing  the  /?(/)’ s  and  A.  The 
derivatives  of  (2)  with  respect  to  the  p(i)  s  and  A  are 

1  n  T 

- =  V /(Fm  <  Yj  <  oo )eXJfS  +nX,  i  = 

P(i) 

(3) 

and 

m 

Ew)  =  l.  (4) 

i= 1 

Therefore,  we  can  iterate  between  (3),  (4),  and 


n 

LlA'7 (Yi  <00 )-F{Yi )cx^|X,  =  0  (5) 

i= 1 

to  obtain  the  estimators. 

When  X  is  not  observable  but  the  W’s  are  observed  instead, 
we  modify  the  estimating  equations  so  that  they  are  functions  of 
the  observed  data  and  yield  consistent  estimators.  We  keep  (4) 
unchanged.  Following  the  corrected  score  approach,  we  modify 
the  m  equations  in  (3)  by  replacing  ex *  ^  with  ew*  v0/2? 


1 

Pd) 


±Y£i(rm<YJ*oo)'«T»>->T'",>+»x. 

7  =  1  7  k=  1 


i  =  1 , . . . ,  m .  (6) 


An  alternative  way  to  handle  multiple  measurements  is  to  take 
an  average  of  the  W^’s  for  each  i  a  priori  to  form  a  single 
“better”  observation,  that  is,  using 


x  {\ogfc(Y\X)-F(Y)exTp} 

+ 1  (F  =  oo){log5c(oo|X)  -  exTq, 

where  FT7}  denotes  the  jump  size  of  F(-)  at  7  and  F(-)  is  a 
right-continuous  function  with  jumps  at  event  times  only.  For 


—  =Y  I  (Y(i)  <  Yj  <  oc)eW^-PTy>P/2+nX 
AO  U 

to  replace  (3),  where  W/  =  r/_1  J2k=i  W. a  and  V/  =  rfW.  In 
practice,  we  have  found  that  the  two  treatments  of  the  W^’s 


746 


Journal  of  the  American  Statistical  Association,  June  2008 


yield  very  similar  results.  To  modify  (5),  we  replace  X/  with 
W«  and  exf^Xi  with  ew Jp-fiT\p/2(y/.  _  y/J)  t0  obtain 


-  F(Yi)ew‘^TY^2( Wik  -  Vp)}  =  0.  (7) 

The  final  estimators  under  the  corrected  scores  can  be  obtained 
by  solving  (4),  (6),  and  (7)  simultaneously. 


(C2)  Conditional  on  X,  the  censoring  time  C  is  independent 
of  7\andP(C  =  oc|X)>0. 

(C3)  The  true  parameter  /?0  belongs  to  the  interior  of  a 
known  compact  set  So,  and  the  true  cdf  Fq  is  differen¬ 
tiable  with  its  first  derivative  fo(t)  >0  for  all  t  e  7Z+ . 

These  are  rather  mild  conditions  that  are  routinely  made  in  cure 
rate  models.  We  now  present  the  asymptotic  properties  of  the 
estimators,  including  strong  consistency,  asymptotic  normality, 
and  the  variance  estimation  formula. 


3.  ASYMPTOTIC  PROPERTIES 

For  ease  of  exposition,  we  focus  on  the  situation  in  which 
we  have  one  surrogate  W  for  the  true  unobserved  X.  We  first 
introduce  some  notation  and  define  as  the  space  of  all 

bounded  linear  functionals  on  H,  where 

W  =  {(hi,A2):hie^,||hi||<l, 


Theorem  1.  Under  the  regularity  conditions,  assume  the  lim¬ 
iting  estimating  equation 

p{s*(0,  F)t hi  +  SF(fi,  F)[J  QF(h2)dF^  J  =  0 

has  a  unique  zero.  With  probability  1,  the  estimators  j$n  and 
Fn(t)  of  (4),  (6),  and  (7)  satisfy 


h2  is  a  function  in  [0,  oc)  with  \\h2W  v  <  1} 

and  \\h2Wv  is  the  total  variation  of  /*2-  Let  /(/?,  F)  be  the  log- 
likelihood  conditional  on  X, 

/(/?,  F)  =  -F(Y)exTp  +  A I(Y  <  oo){log/(7)  +  XT 0}, 

where  we  omit  the  part  of  /  (/? ,  F)  that  does  not  involve 
or  F.  Denote  the  derivative  of  /(/?,  F)  with  respect  to  /?  as 
\p(($,F)  and  write  If(P,F)[J  QF(h2)dF]  as  the  derivative 
of  /(/?,  F)  along  the  path  (/?,  F€  =  F  +  e  f  QFih2)  dF),  e  e 
(—60,  60)  for  a  small  constant  60  >  0,  where  QF(h 2)  =  h2(t )  — 
/o°°  ^2(0  dF(t).  This  operation  ensures  that  QF(h 2)  integrates 
to  0,  and  thus  the  perturbed  F  remains  a  valid  cdf.  This  re¬ 
striction  plays  the  same  role  as  the  Lagrange  multiplier.  The 
corrected  scores  can  be  constructed  by  replacing  the  terms  in¬ 
volving  X  in  1  F)  and  lF(P,  F)[f  Qp(h2)dF ]  with  those 
involving  W.  We  denote  the  expressions  after  such  replace¬ 
ment  as  Sp(fi,F)  and  Sf(P,  F)[f  QfQi2)  dF].  Note  that 
/  QFQi2)dF  does  not  involve  X  or  W.  Straightforward  cal¬ 
culation  yields  that 

SpQ 3,  F)  =  -F(F)ewr/J“^v/J/2(W  -  V/?)  +  A I(Y  <  oo)W 
and 

Sf(P,F)\J  QF(h2)dF 

=  -  f¥  QF(h2)dF(t)ewT^Ty^2 

Jo 

+  AI(Y  <  00)  Q F {h2(Y)} . 

Let  ¥n  and  P  denote  the  empirical  measure  of  n  iid  observa¬ 
tions  and  the  expectation;  that  is,  for  any  measurable  function 
g(Y,  A,  X)  in  L2(P), 

1  n 

P„ [g(Y,  A ,  X)]  =  -  V  g(Yi ,  A,- ,  X/)  and 
n  f— ' 

1= 1 

¥[g(Y,A,X)]  =  E[g(Y,A,X)]. 

We  assume  that  our  model  is  identifiable  and  that  the  following 
regularity  conditions  are  satisfied: 

(Cl)  The  covariate  W  is  bounded  with  probability  1. 


I  Pn  ~  Po  \  0  and  SUP  I  Fn(t)  ~  F0(t)  |  ->  0. 

teK+ 

Theorem  2.  Under  the  regularity  conditions,  ^/n(Pn  —  Pq, 
Fn  —  Fo)  converges  weakly  to  a  mean-0  Gaussian  process  in 


The  proofs  of  these  two  theorems  depend  heavily  on  the  em¬ 
pirical  process  theory  (van  der  Vaart  and  Wellner  2000),  which 
are  outlined  in  the  Appendix. 

Theorem  3.  Under  the  regularity  conditions,  the  estimator 
satisfies 

MK  -  fio)  N(0,  A-^CA-1)7) 


in  distribution  as  n  —>  00,  where 


A  = 


E 


^eWr/}-/}rV0/2 


F0(Y){\  -  (W  -  V/J)(W  -  \p)T] 


rY 

-(W-V0)/  b4(y)TdF0(y) 

Jo 

[  b4(y)dF0(y) 

0 

and  b4  is  given  in  (A. 5)  in  the  Appendix. 

The  derivation  of  the  variance  sandwich  formula  with  covari¬ 
ate  measurement  errors  is  very  different  from  that  without  mea¬ 
surement  errors,  as  shown  in  the  Appendix. 


K=\Sf$(Po,F0)  +  SF(Po,Fo) 


4.  NONNORMAL  AND  NONADDITIVE  ERROR 


Measurement  error  models  often  require  transforming  co¬ 
variates  toward  the  error  normality  and  additivity.  Considering 
one  component  of  W  and  X,  we  need  to  find  a  suitable  transfor¬ 
mation  function  0  so  that  0(W)  =  <p(X)  +  U,  where  U  follows 
a  mean-0  normal  distribution.  When  the  transformation  0  be¬ 
longs  to  a  parametric  family  indexed  by  y,  for  the  case  with 
duplicates  (r*  =  2),  we  have  that 

0(Wi,  y)  =  0(X,  y)  +  Ui,  0(W2,  y)  =  0(X,  y)  +  U2, 

where  U\  and  U2  are  independent  mean-0  normal  variables. 
We  can  estimate  the  parameter  y  through  the  maximum  like¬ 
lihood  approach  based  on  {0(Wii,  y)  —  0(W/2,  y)}/V 2,  for 
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i  =  1, . . . ,  n.  But  in  many  practical  situations,  a  standard  trans¬ 
formation  family,  such  as  the  Box-Cox  or  power  transforma¬ 
tion,  may  not  be  sufficient  to  achieve  the  desired  normal  and 
additive  error  structure.  Eckert  et  al.  (1997)  proposed  a  class 
of  transformations  based  on  piecewise  cubic  spline  functions. 
This  family  of  transformation  is  nonparametric  and  versatile 
and  often  performs  superior  to  the  Box-Cox  transformation. 
However,  its  implementation  can  be  difficult,  and  the  transfor¬ 
mation  contains  several  ad  hoc  procedures. 

To  enhance  the  flexibility  of  our  model,  we  propose  a  trans¬ 
formation  that  is  easy  to  use  and  completely  data-driven.  We 
first  sort  all  of  the  W*/s  in  an  increasing  order,  denoted  as 
(W(1),  W(2), . . . ,  W{2n)).  Let  bi  =  4>(Wa)  -  0(W/2),  b(i)'s  be 
the  order  statistics,  and  let  (q\,  <72,  •  •  • ,  Qn)  correspond  to  the 
(.5/n,  1.5/tz,  . . . ,  1  —  .5 /n)  quantiles  of  the  standard  normal 
distribution.  Then  we  search  for  the  set  of  4>(Wij)'s  that  min¬ 
imizes  [{^(/)  —  H'bMvb  ~  #/]2,  where  fit,  and  <t£  are 

the  sample  mean  and  sample  variance  of  the  bi' s.  The  mini¬ 
mization  is  performed  under  a  monotonic  constraint  that  the 
0(W//)’s  follow  exactly  the  same  order  as  the  W//s,  that  is, 
0(W( i))  <  <t>(W(2))  <  •  •  •  <  0(W(2«)).  The  basic  intuition  be¬ 
hind  this  operation  is  that  we  want  to  find  a  transformation  0 
so  that  the  resulting  n  sample  quantiles  are  the  closest  to  the 
expected  quantiles  in  terms  of  the  mean  squared  error  (MSE). 
Other  than  monotonicity,  we  impose  no  constraints  on  0;  thus 
this  transformation  is  more  flexible  than  those  proposed  in  the 
literature.  For  convenience,  we  use  the  MSE  as  the  evalua¬ 
tion  criterion  for  the  transformation;  one  certainly  could  opt  for 
other  criteria,  such  as  the  mean  absolute  deviation  or  a  weighted 
average,  which  may  emphasize  the  central  part  more  than  the 
tail  part  of  the  data  or  may  focus  only  on  the  maximum  dis¬ 
tance. 

The  proposed  transformation  is  rank-preserving  but  cannot 
make  a  distinction  between  0  and  ao  +  a\4>  for  any  con¬ 
stants  clq  and  a\.  Thus,  to  ensure  identifiability,  we  set  the 
first  two  values,  <p(W(\))  and  0(W(2))»  to  two  constants,  say 
0(lL(i))  =  1  and  <p(W(2))  =  2.  Our  original  problem  involves 
an  order-constrained  minimization,  which  often  requires  rather 
specialized  optimization  routines.  If  we  reparameterize  and 
take  0(W(7))  =  Y^j=i  eTj  ’  then  we  can  minimize  ~~ 

[ib\lGb  ~  Qi]2  without  constraints  to  obtain  the  tj's  and  hence 
the  0(W*7)’s.  Note  that  a  single  value  change  in  any  of  the  tj's 
would  cause  changes  in  i±b  and  at ;  thus  the  optimization  cannot 
be  simplified  by  investigating  each  individual  term  separately. 
In  addition,  the  fixed  order  of  0(W7)’s  does  not  imply  a  fixed 
order  of  bi' s;  thus  the  objective  function  may  not  be  differen¬ 
tiable  at  the  r j  values  at  which  a  change  in  the  order  of  bi 's 
occurs.  Because  of  these  considerations,  we  use  a  large-scale 
Nelder-Mead  simplex  method  as  the  optimization  procedure,  in 
combination  with  multiple  sets  of  dispersed  starting  values  for 
xj' s,  to  avoid  convergence  to  local  minima.  It  is  worth  point¬ 
ing  out  that  although  the  optimal  solution  gives  the  best  trans¬ 
formation  toward  normality,  in  reality,  we  would  be  content  as 
long  as  the  resulting  0(W*  1)  —  0  ( W/2)  was  sufficiently  close 
to  normality.  Various  procedures  can  be  used  to  examine  the 
performance  of  the  transformation.  We  formulate  the  Pearson- 
type  statistic  in  the  form  of  J2k=l  —  Ok)  /Ek,  where  K  is 
the  number  of  partitions  of  the  data  space,  and  Ek  and  Ok  are 
the  expected  and  observed  bin  counts.  Under  the  null  model  in 


which  (bi  —  f JLb)/<?b  follows  the  standard  normal  distribution, 
the  Pearson  statistic  asymptotically  follows  a  chi-squared  dis¬ 
tribution  with  degrees  of  freedom  K  —  1  (see,  e.g.,  Rao  1973). 
The  proposed  nonparametric  transformation  is  straightforward 
and  is  quite  effective,  as  we  show  in  our  numerical  studies. 
Once  we  obtain  the  0(W7)’s,  the  variance  V  can  be  easily  es¬ 
timated  using  the  sample  variance  of  {0(W/ 1)  —  0(W*2)}/v/2, 
i  =  1 , . . . ,  n . 


In  general,  there  is  no  guarantee  that  a  normal  additive  er¬ 
ror  can  always  be  achieved.  In  cases  where  the  normal  er¬ 
ror  cannot  be  obtained,  the  estimating  equations  (3)  and  (5) 


should  be  corrected  by  replacing  X; ,  e 
e^lEieW)  and  Xje^  with 
eVfjPE(VjeV^)/E(eVJ^. 


with  W j  -  EOJj), 
W je^/EieW)  - 


5.  NUMERICAL  STUDIES 


5.1  Simulation 

We  conducted  three  sets  of  simulation  studies  to  examine  the 
small- sample  performance  of  the  proposed  methods.  First,  we 
studied  a  cure  rate  model  function, 

S(t \XU  X2)  =  exp{—  exp(/30  +  £1*1  +  faX2)F(t) },  (8) 

where  X\  is  a  uniformly  distributed  random  variable  on  [0,  1] 
and  subject  to  measurement  errors  and  X2  is  a  Bernoulli  ran¬ 
dom  variable  that  takes  a  value  of  0  or  1  with  equal  probability. 
We  took  the  true  parameters  fa  =  .5,  fa  =  1,  2  =  —-5,  and 
F(t)  =  1  —  exp(— t).  The  measurement  error  model  was  formu¬ 
lated  as  W  =  X\  +  U,  where  U  was  a  normal  random  variable 
with  mean  0  and  standard  deviation  o .  We  considered  a  =  .  1 
and  .2  to  examine  the  impact  of  the  measurement  error  on  the 
estimators.  When  the  censoring  time  was  generated  from  an  ex¬ 
ponential  distribution  with  mean  1,  designated  as  exp(l),  the 
resulting  data  set  had  an  approximate  censoring  rate  of  17%, 
and  a  cure  rate  of  8%;  and  when  the  censoring  time  was  gener¬ 
ated  from  exp(.l),  it  yielded  a  censoring  rate  of  33%.  We  took 
sample  sizes  of  n  =  200  and  300,  and  performed  1,000  sim¬ 
ulations  under  each  configuration.  For  each  data  replicate,  we 
implemented  the  backfitting  procedure  to  estimate  /3o ,  fa,  and 
fa  and  the  corresponding  variances.  The  corrected  estimating 
equations  were  solved  using  the  Newton-Raphson  algorithm, 
which  converged  very  fast  and  was  quite  robust  to  the  initial 
values.  For  comparison,  we  also  carried  out  a  naive  estimation 
procedure,  in  which  the  measurement  error  was  ignored  and 
W  was  treated  as  X\.  The  simulation  results  are  presented  in 
Tables  1  and  2,  corresponding  to  censoring  rates  of  17%  and 
33%.  As  we  can  see,  even  with  a  small  measurement  error  scale, 
a  =  .  1 ,  the  naive  estimator  of  fa  was  biased  upward  and  that  of 
fa  was  biased  downward,  and  these  biased  increased  severely 
as  the  measurement  error  increased  to  a  =  .2.  The  correspond¬ 
ing  coverage  probabilities  of  95%  confidence  intervals  were  un¬ 
der  the  nominal  level,  especially  for  the  cases  with  a  =  .2.  In¬ 
terestingly,  because  covariate  X2  was  measured  precisely,  the 
estimator  of  fa  under  the  naive  method  performed  well;  the  bias 
was  negligible,  and  the  coverage  probability  was  close  to  95%. 
In  contrast,  the  proposed  estimator  successfully  corrected  the 
bias  under  all  of  the  scenarios.  Moreover,  the  estimated  vari¬ 
ances  based  on  the  asymptotic  normal  approximation  formula 
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Table  1.  Simulation  results  under  model  (8)  with  17%  censoring 


Proposed  method  Naive  method 


n 

Estimate 

ft 

pi 

ft 

ft 

ft 

ft 

200 

Bias 

-.011 

.031 

.002 

.041 

-.089 

.006 

Empirical  variance 

.039 

.095 

.031 

.035 

.072 

.030 

Estimated  variance 

.037 

.098 

.028 

.031 

.073 

.027 

95%  cv 

.941 

.956 

.936 

.923 

.927 

.929 

300 

Bias 

-.005 

.021 

-.001 

.046 

-.096 

.003 

Empirical  variance 

.028 

.066 

.019 

.025 

.050 

.018 

Estimated  variance 

.025 

.064 

.018 

.020 

.048 

.018 

95%  cv 

.932 

.939 

.956 

.901 

.926 

.950 

200 

Bias 

-.007 

.028 

-.003 

.152 

-.336 

.007 

Empirical  variance 

.054 

.164 

.032 

.033 

.058 

.030 

Estimated  variance 

.052 

.156 

.040 

.026 

.055 

.027 

95%  cv 

.941 

.946 

.944 

.805 

.687 

.936 

300 

Bias 

-.008 

.026 

-.004 

.148 

-.331 

.005 

Empirical  variance 

.034 

.103 

.020 

.022 

.038 

.018 

Estimated  variance 

.033 

.100 

.019 

.017 

.037 

.018 

95%  cv 

.943 

.952 

.951 

.764 

.576 

.950 

NOTE:  95%  CV  represents  the  coverage  probability  of  95%  confidence  intervals. 


were  quite  close  to  the  empirical  variances,  and  our  method  pro¬ 
duced  satisfactory  coverage  probabilities  at  the  95%  nominal 
level.  When  o  increased,  the  estimated  variances  for  the  /Ts 
increased  as  more  variability  was  incorporated  into  the  model 
and  the  estimation  procedure.  However,  the  opposite  was  true 
for  the  naive  estimators.  Because  W  was  treated  as  the  true  co¬ 
variate  X  \ ,  more  variation  in  the  covariate  would  produce  bet¬ 
ter  estimators;  the  variances  using  the  naive  method  were  in  fact 
smaller  for  a  =  .2  compared  with  those  with  a  =  .  1 .  At  a  higher 
censoring  rate,  as  shown  in  Table  2,  similar  conclusions  can  be 
drawn.  The  estimated  variances  increased  as  the  censoring  per¬ 
centage  increased  and  decreased  as  the  sample  size  grew  large. 

Our  second  simulation  was  designed  to  study  a  scenario  with 
replicates  for  mismeasured  co variates.  We  considered  a  cure 


rate  model, 

S(t\Xi,  X2,  X3,  X4)  =  exp{— exp(ft  +  ftXi  +  p2X2 

+  p3X3  +  p4X4  )F(r)},  (9) 

where  both  X\  and  X 2  are  Bernoulli  random  variables  that 
take  a  value  of  1  with  probabilities  of  .5  and  .6,  and  X3  and 
X4  are  generated  from  uniform  distributions  on  [—.5,  .5]  and 
[0, 1].  Here  X4  was  unobservable;  instead,  we  observed  two 
replicates  (W\ ,  W2),  where  each  replicate  was  X4  plus  a  nor¬ 
mal  error  with  mean  0  and  standard  deviation  o  =  .2.  The  true 
parameters  were  fio  =  .5,  /3\  =  .5,  fa  =  —  .5,  ^3  =  1,  @4  =  —  1, 
and  F(t)  =  1  —  exp(— t).  The  censoring  times  were  generated 
independently  from  exp(l),  yielding  an  approximate  censoring 


Table  2.  Simulation  results  under  model  (8)  with  33%  censoring 


(7 

n 

Estimate 

Proposed  method 

Naive  method 

ft 

ft 

P2 

ft 

ft 

@2 

.1 

200 

Bias 

-.006 

.016 

-.003 

.046 

-.104 

.000 

Empirical  variance 

.054 

.135 

.038 

.047 

.102 

.037 

Estimated  variance 

.050 

.126 

.036 

.041 

.093 

.034 

95%  cv 

.949 

.946 

.937 

.928 

.924 

.935 

300 

Bias 

-.005 

.006 

.003 

.046 

-.110 

.006 

Empirical  variance 

.035 

.085 

.025 

.031 

.065 

.024 

Estimated  variance 

.033 

.082 

.024 

.028 

.062 

.023 

95%  cv 

.939 

.950 

.945 

.917 

.921 

.944 

.2 

200 

Bias 

-.025 

.078 

-.008 

.141 

-.312 

.007 

Empirical  variance 

.063 

.207 

.042 

.038 

.071 

.038 

Estimated  variance 

.070 

.205 

.038 

.036 

.070 

.034 

95%  cv 

.958 

.951 

.938 

.863 

.776 

.934 

300 

Bias 

-.001 

.032 

-.009 

.155 

-.331 

.003 

Empirical  variance 

.048 

.149 

.026 

.030 

.055 

.024 

Estimated  variance 

.044 

.128 

.025 

.023 

.047 

.023 

95%  cv 

.943 

.932 

.946 

.790 

.646 

.950 

NOTE:  95%  CV  represents  the  coverage  probability  of  95%  confidence  intervals. 
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Table  3.  Simulation  results  under  model  (9)  with  25%  censoring 


Proposed  method 

Naive  method 

n 

Estimate 

Po 

Pi 

P2 

Pi 

Pa 

Po 

Pi 

Pi 

Pi 

Pa 

Average  W 

200  Bias 

.021 

.009 

-.013 

.028 

-.037 

-.091 

.005 

-.008 

.020 

.184 

Empirical  variance 

.078 

.044 

.042 

.134 

.182 

.060 

.042 

.040 

.130 

.108 

Estimated  variance 

.069 

.042 

.042 

.128 

.173 

.053 

.041 

.040 

.123 

.099 

95%  cv 

.931 

.952 

.953 

.946 

.948 

.920 

.950 

.953 

.946 

.896 

300 

Bias 

.012 

.002 

-.006 

.013 

-.023 

-.096 

-.002 

-.001 

.006 

.189 

Empirical  variance 

.051 

.030 

.028 

.082 

.115 

.039 

.030 

.027 

.081 

.069 

Estimated  variance 

.045 

.028 

.027 

.084 

.110 

.035 

.027 

.027 

.081 

.065 

95%  cv 

.926 

.948 

.950 

.960 

.953 

.912 

.947 

.952 

.958 

.869 

Replicates  (W\,  W2) 

200  Bias 

.023 

.009 

-.012 

.028 

-.040 

-.162 

.002 

-.005 

.014 

.325 

Empirical  variance 

.080 

.044 

.042 

.135 

.187 

.051 

.042 

.039 

.129 

.073 

Estimated  variance 

.069 

.042 

.042 

.128 

.182 

.046 

.041 

.040 

.122 

.068 

95%  cv 

.929 

.950 

.955 

.944 

.956 

.877 

.954 

.954 

.945 

.745 

300 

Bias 

.012 

.002 

-.006 

.013 

-.024 

-.166 

-.005 

.002 

.001 

.327 

Empirical  variance 

.052 

.030 

.028 

.082 

.117 

.034 

.029 

.027 

.080 

.047 

Estimated  variance 

.045 

.028 

.027 

.084 

.114 

.030 

.027 

.026 

.081 

.045 

95%  cv 

.926 

.946 

.951 

.959 

.952 

.835 

.945 

.953 

.956 

.654 

NOTE:  95%  CV  represents  the  coverage  probability  of  95%  confidence  intervals. 


rate  of  25%.  We  implemented  two  different  treatments  of  the 
replicates:  averaging  W\  and  W2  to  obtain  a  single  “better” 
measurement,  W  —  (W\  +  W2)/2,  and  incorporating  each  in¬ 
dividual  measurement  (W\ ,  W2)  in  the  estimation  as  in  (6)  and 
(7).  The  simulation  results,  given  in  Table  3,  show  that  the  esti¬ 
mates  using  W  or  (W\,  W2)  were  comparable.  In  particular,  the 
bias  could  be  corrected  satisfactorily  compared  with  the  naive 
method,  the  asymptotic  variance  provided  a  good  approxima¬ 
tion  of  the  empirical  variance,  and  the  95%  coverage  probabil¬ 
ity  closely  matched  the  nominal  level.  As  the  sample  size  in¬ 
creased,  the  bias  and  variance  decreased.  But  the  naive  method 
had  obvious  biases  in  the  estimates  of  /3o  and  fa,  whereas  the 
parameter  estimates  for  the  precisely  measured  covariates  X\, 
X2,  and  A3  were  satisfactory.  Furthermore,  it  is  interesting  that 
using  the  duplicates  (  W\ ,  W2)  led  to  much  worse  biases  and 
coverage  probabilities  than  those  resulting  from  using  the  av¬ 
erage  W  based  on  the  naive  method.  The  averaged  covariate 
values  could  offset  the  effect  of  measurement  errors  to  a  certain 
extent,  because  the  random  noise  would  diminish  by  averaging 
over  multiple  replicates. 

In  the  third  simulation,  we  conducted  a  sensitivity  analysis 
to  demonstrate  the  effectiveness  of  the  proposed  transforma¬ 
tion  and  the  robustness  of  our  model  to  the  misspecified  normal 
additive  error.  We  examined  model  (8)  with  measurement  er¬ 
ror  structure  W  =  exp(Ai  +  17),  where  we  generated  U  from  a 
normal,  a  Student  t  with  degrees  of  freedom  10  and  5,  and  a  uni¬ 
form  distribution.  We  set  the  mean  of  U  to  0  and  the  standard 
deviation  of  U  to  .2  for  all  of  the  scenarios.  We  implemented 
the  proposed  transformation  on  the  covariate,  even  though  for 
all  cases  but  the  first,  whether  or  not  a  normal  additive  error 
structure  could  be  obtained  is  not  clear.  Table  4  shows  that  the 
proposed  method  performed  well  by  imposing  our  transforma¬ 
tion  when  W  is  linked  to  X  through  a  nonnormal  and  nonaddi¬ 
tive  error.  We  could  capture  the  true  transformation  that  recov¬ 
ered  the  normal  additive  error  structure.  When  the  normality 


assumption  was  violated,  our  estimation  procedure  appeared  to 
be  quite  robust  and  still  produced  estimates  with  very  small  bi¬ 
ases.  As  the  degree  of  freedom  of  the  t  distribution  decreased  to 
5,  the  performance  deteriorated  slightly.  In  the  simulations  not 
reported  here,  we  also  explored  other  transformations,  such  as 
W  =  {X\  +  I/)3,  and  found  similar  results. 

In  the  foregoing  simulations,  the  censoring  distribution  has 
an  infinite  support.  Because  in  reality  the  censoring  time  is  al¬ 
ways  finite,  we  also  conducted  simulations  in  which  the  censor¬ 
ing  distribution  was  finitely  supported.  Here  a  subject  is  con¬ 
sidered  cured  if  the  subject  is  censored  and  the  corresponding 
censoring  time  is  larger  than  the  largest  observed  event  time. 
The  estimator  remains  consistent,  and  the  variance  estimation 
and  the  95%  coverage  probability  are  satisfactory  as  well. 

5.2  Lung  Cancer  Data 

As  an  illustration,  we  applied  the  cure  rate  model  with  mea¬ 
surement  errors  to  the  lung  cancer  data  set.  The  study  group 
comprised  280  patients.  The  covariates  of  interest  included  ei¬ 
ther  one  or  two  readings  of  biomarker  expression,  tumor  his¬ 
tology  (61%  adenocarcinoma  =  1;  39%  squamous  cell  carci¬ 
noma  =  0),  and  patient  age  (range,  34  to  90  years;  mean,  66 
years)  and  sex  (52%  female  =  1;  48%  male  =  0).  The  covari¬ 
ate  age  was  standardized  to  have  mean  0  and  variance  1.  The 
underlying  true  expression  of  the  biomarker  could  not  be  mea¬ 
sured  precisely.  For  half  of  the  patients,  only  one  reading  of 
biomarker  expression  was  available,  whereas  for  the  other  half, 
two  different  readings  were  recorded,  with  no  preference  given 
to  either  reading. 

For  the  140  patients  with  2  readings  of  biomarker  expres¬ 
sion,  we  took  the  difference  of  the  logarithm  of  the  2  readings 
and  found  that  the  original  observations  of  biomarker  expres¬ 
sion  did  not  satisfy  the  normal  additive  error  structure.  After 
carrying  out  our  transformation  on  the  readings  of  biomarker 
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Table  4.  Transformation  and  sensitivity  analysis  of  the  error  structure  under  model  (8) 


Error 

Estimate 

Proposed  method 

Naive  method 

Po 

Pi 

Pi 

Po 

P\ 

Normal 

Bias 

-.017 

.047 

-.003 

.147 

-.334 

.008 

Empirical  variance 

.055 

.163 

.032 

.034 

.057 

.029 

Estimated  variance 

.052 

.159 

.030 

.026 

.056 

.028 

95%  cv 

.952 

.951 

.938 

.808 

.695 

.943 

*10 

Bias 

-.015 

.044 

-.006 

.147 

-.334 

.009 

Empirical  variance 

.057 

.173 

.033 

.035 

.061 

.030 

Estimated  variance 

.051 

.159 

.030 

.026 

.056 

.028 

95%  cv 

.939 

.942 

.946 

.812 

.690 

.946 

*5 

Bias 

-.084 

.203 

-.011 

.120 

-.273 

.007 

Empirical  variance 

.066 

.220 

.033 

.036 

.067 

.030 

Estimated  variance 

.062 

.203 

.031 

.027 

.061 

.028 

95%  cv 

.948 

.939 

.946 

.847 

.780 

.947 

Uniform 

Bias 

-.017 

.047 

-.003 

.145 

-.329 

.007 

Empirical  variance 

.054 

.150 

.031 

.035 

.055 

.029 

Estimated  variance 

.052 

.158 

.030 

.026 

.057 

.028 

95%  cv 

.954 

.957 

.952 

.803 

.718 

.944 

Normal 

Bias 

-.006 

.017 

-.001 

.150 

-.341 

.009 

Empirical  variance 

.035 

.106 

.020 

.022 

.041 

.019 

Estimated  variance 

.033 

.100 

.019 

.017 

.037 

.018 

95%  cv 

.936 

.944 

.951 

.762 

.578 

.948 

*10 

Bias 

-.009 

.027 

-.002 

.147 

-.333 

.007 

Empirical  variance 

.033 

.103 

.022 

.021 

.039 

.020 

Estimated  variance 

.033 

.099 

.019 

.017 

.037 

.018 

95%  cv 

.950 

.941 

.934 

.756 

.586 

.937 

*5 

Bias 

-.075 

.180 

-.006 

.120 

-.272 

.006 

Empirical  variance 

.038 

.128 

.023 

.022 

.042 

.020 

Estimated  variance 

.039 

.124 

.020 

.018 

.040 

.018 

95%  cv 

.952 

.941 

.935 

.816 

.729 

.937 

Uniform 

Bias 

-.006 

.021 

-.003 

.147 

-.331 

.007 

Empirical  variance 

.032 

.099 

.020 

.021 

.038 

.019 

Estimated  variance 

.033 

.098 

.019 

.017 

.037 

.018 

95%  cv 

.957 

.945 

.941 

.768 

.567 

.951 

NOTE:  95%  CV  represents  the  coverage  probability  of  95%  confidence  intervals. 


expression,  we  can  see  that  the  error  structure  was  much  closer 
to  normal  based  on  the  quantile-quantile  plot  in  Figure  2.  We 
also  performed  the  Pearson  chi- squared  test,  under  which  we 
obtained  a  p  value  >  .7  for  K  ranging  from  4  to  10;  thus  a  nor¬ 
mal  error  structure  after  the  transformation  was  quite  convinc¬ 
ing.  For  patients  with  duplicated  readings  of  biomarker  expres¬ 
sion,  we  used  the  averaged  value  W  and  the  individual  observa¬ 
tions  (W\,  W2)  for  the  analysis.  Table  5  shows  that  ignoring  the 
measurement  error  could  cause  severe  bias,  particularly  in  the 
estimates  of  the  intercept  and  the  biomarker  expression  effect. 
We  found  that  biomarker  expression  significantly  affected  pa¬ 
tient  survival;  a  higher  expression  was  associated  with  a  shorter 
survival  time.  The  naive  method  tended  to  underestimate  the 
variance  and  produce  a  downward  bias  for  the  biomarker  ef¬ 
fect.  As  for  other  error- free  covariates,  the  proposed  method  and 
the  naive  method  yielded  similar  estimates.  Patients  with  a  tu¬ 
mor  histology  of  adenocarcinoma  had  a  significantly  better  sur¬ 
vival  rate  than  those  with  squamous  cell  carcinoma;  moreover, 
younger  patients  could  be  expected  to  live  longer  at  a  lower  risk 
of  death.  There  was  no  significant  difference  in  survival  across 


sex  in  this  study  population,  although  there  was  a  trend  that 
women  might  live  longer. 

The  cure  threshold  in  our  model  could  be  determined  through 
consultation  with  physicians,  which  is  a  medical  issue  based  on 
the  patient  population  and  disease  status.  Because  such  a  thresh¬ 
old  is  restricted  to  lie  to  the  right  of  the  largest  failure  time, 
we  conducted  a  sensitivity  analysis  by  taking  the  cure  thresh¬ 
old  at  7,  7.5,  or  8  years.  We  found  that  the  parameter  estimates 
were  not  sensitive  to  the  specification  of  the  threshold,  because 
it  only  affected  the  censored  observations  at  the  right  tail. 

6.  DISCUSSION 

We  have  proposed  a  semiparametric  cure  rate  model  with  co¬ 
variate  measurement  errors.  The  model  inherits  the  well-known 
proportional  hazards  structure,  with  the  corrected  score  func¬ 
tions  derived  based  on  NPMLE  to  estimate  p  and  F(t).  The  as¬ 
ymptotic  consistency  and  root -n  convergence  of  the  estimators 
were  established  through  modem  empirical  process  techniques. 
Simulation  studies  showed  that  the  corrected  score  approach 
produced  consistent  estimators,  whereas  the  naive  estimation 
typically  led  to  severe  biases. 


Ma  and  Yin:  Cure  Model  With  Mismeasured  Error 


751 


(a)  (b) 


Figure  2.  Quantile-quantile  plots  of  the  error  with  the  transformed  readings  of  biomarker  expression  (a)  and  the  original  observations  (b)  for 
the  lung  cancer  data. 


For  the  proportional  hazard  cure  model,  the  NPMLE  and 
partial  likelihood  estimator  were  in  fact  identical.  This  can  be 

verified  by  assigning  nX  in  NPMLE  to  J2yj=o o  e  •  The  de- 
tails  of  this  derivation  are  given  in  the  Appendix.  Naturally, 
the  corrected  score  estimator  thus  also  can  be  derived  equiv¬ 
alently  through  the  partial  likelihood  approach.  In  our  view,  us¬ 
ing  NPMLE  has  several  advantages.  First,  consistency  is  self- 
evident  through  a  conditional  distribution  argument.  Second, 
the  cure  rate  model  and  the  Cox  model  are  fundamentally  dif¬ 
ferent;  therefore,  to  estimate  the  intercept  b  using  the  partial 
likelihood  procedure,  we  need  to  reparameterize  the  model  by 
taking  b  =  log  Ao(oo).  Third,  the  partial  likelihood  approach 
is  restricted  only  to  the  proportional  hazards  structure,  whereas 
NPMLE  is  much  more  general.  For  example,  the  proportional 
odds  cure  model  can  be  easily  handled  by  NPMLE,  whereas 
the  partial  likelihood  is  not  applicable  (see  Zeng  et  al.  2006). 
Even  though  finding  a  corrected  score  is  not  always  straight¬ 
forward  or  even  possible,  a  general  Monte  Carlo-corrected 
score  method  can  be  implemented  in  practice.  In  this  regard, 


Table  5.  Regression  coefficient  estimates  and  estimated  variances  for 
the  lung  cancer  data,  using  the  averaged  and  individual  reading 
of  biomarker  expression 


Estimate 

Intercept 

Histology 

Age 

Sex 

Biomarker 

Averaged  reading  W 
Proposed  estimate 

-.2904 

-.5622 

.4352 

-.0687 

.0505 

Estimated  variance 

.2990 

.0639 

.0138 

.0450 

.0077 

Naive  estimate 

-.1672 

-.5306 

.4337 

-.0748 

.0299 

Estimated  variance 

.1347 

.0489 

.0130 

.0426 

.0026 

Individual  reading  (W\,  W2) 
Proposed  estimate  —.2648 

-.5555 

.4348 

-.0697 

.0462 

Estimated  variance 

.2885 

.0624 

.0138 

.0450 

.0074 

Naive  estimate 

-.1448 

-.5248 

.4334 

-.0758 

.0261 

Estimated  variance 

.0923 

.0476 

.0134 

.0434 

.0020 

W i  can  be  augmented  with  U iV—l  to  form  W; ,  where  U; 
has  the  same  distribution  as  the  measurement  error  U; ;  cal¬ 
culate  the  score  function  using  W \ ;  and  set  the  real  part  to  0 
to  solve  for  /?.  Although  the  Monte  Carlo-corrected  score  ap¬ 
proach  does  not  always  guarantee  a  consistent  estimator,  in  the 
case  when  a  true  corrected  score  does  exist,  it  will  be  consis¬ 
tent;  thus  it  can  be  viewed  as  a  numerical  way  of  finding  a 
corrected  score.  Further  exploration  in  these  areas  should  be 
worthwhile. 

The  corrected-score  method  belongs  to  the  family  of  func¬ 
tional  approaches  that  make  no  distributional  assumptions  on 
the  unobservable  true  covariates,  as  opposed  to  structural  mod¬ 
els  that  specify  a  distribution  of  X.  However,  the  corrected  score 
and  its  generalized  form  depend  on  the  additive  error  structure. 
Because  in  reality  not  all  measurements  can  be  transformed  to 
normality  and/or  additivity,  the  more  general  semiparamatric 
approach  proposed  by  Tsiatis  and  Ma  (2004)  is  worth  explor¬ 
ing.  Preliminary  studies  toward  this  end  have  uncovered  several 
modeling  and  computational  issues,  including  the  need  to  esti¬ 
mate  the  censoring  mechanism  or  strong  assumptions,  such  as 
censoring,  independent  of  the  covariates.  These  same  difficul¬ 
ties  also  prevent  us  from  absorbing  the  transformation  inside  of 
the  estimation  procedure  itself. 

An  alternative  approach  in  functional  measurement  error 
models  is  based  on  the  simulation-extrapolation  (SIMEX) 
method  (Cook  and  Stefanski  1994;  Stefanski  and  Cook  1995; 
Li  and  Lin  2003;  Greene  and  Cai  2004).  SIMEX  first  simulates 
data  sets  with  an  increasing  amount  of  measurement  errors  and 
then  extrapolates  back  to  the  nonerror  case.  Because  the  correct 
extrapolation  function  is  generally  unknown,  in  theory  SIMEX 
can  produce  only  approximately  consistent  estimates.  However, 
in  practice  its  performance  is  often  satisfactory,  and  sometimes 
it  even  outperforms  the  asymptotically  “correct”  methods.  It 
would  be  interesting  to  implement  SIMEX  under  the  cure  rate 
models  with  measurement  errors. 

In  contrast  to  the  usual  classical  measurement  error  structure, 
where  W  =  X  +  U,  U  is  independent  of  X,  in  another  class  of 
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errors,  called  Berkson  error,  X  =  W  +  U  and  U  is  independent 
of  W.  Berkson  error  models  are  often  relatively  easier  to  deal 
with,  because  the  distribution  of  the  unobservable  variable  X 
does  not  appear  in  the  likelihood.  In  the  Cox  model  framework, 
Zucker  (2005)  considered  such  an  error  structure  and  proposed 
a  consistent  estimator.  Similar  estimators  can  be  developed  for 
the  cure  rate  model.  Further  consideration  in  the  presence  of  a 
mixture  of  classical  and  Berkson  errors  may  be  also  of  interest. 

APPENDIX:  PROOFS 


Proof  of  Theorem  1 

Multiplying  (6)  by  /?(/)  =  Fw{7/ }  on  both  sides  and  summing  over 
the  m  equations,  we  have 

1  n  no o 

in  =  -YJ^iHYi<oo)-  /  Hn(y,Pn)dFn(y),  (A.l) 
Jo 

where 

Hn(y,pn)  =  -  J2  I(Yj>y)e'NyP"~P"VPJ1. 
n  L — ' 

Yj  coo 

Thus  Fn{Yi)  =  A i/n{Xn  +  77^(7/ , /?„)}.  Obviously,  from  (A.l),  kn 
should  be  bounded  by  a  constant  with  probability  1;  therefore,  by 
choosing  a  subsequence,  still  indexed  by  {n},  we  assume  that  kn  ->  7* . 
By  choosing  a  further  subsequence,  we  assume  that  Pn  ->  /?*  and 
Fn  ->  F*  pointwise. 

Note  that  the  classes  {W}  and  {7(7  >  y)}  are  P-Donsker,  because 
of  their  monotonicity  and  uniform  boundedness.  Under  the  continu¬ 
ously  differentiable  operation  of  taking  the  exponential  and  the  alge¬ 
braic  operation  of  multiplication,  the  class 

{l(oc>Y>y)e^Tp-pT^p/2:PeB0} 

also  is  P-Donsker  (see  van  der  Vaart  and  Wellner  2000,  thms.  2.7.5, 
2.10.6,  and  2.10.8)  and  thus  is  Glivenko-Cantelli.  Due  to  the  Gliven- 
ko-Cantelli  theorem  and  the  bounded  convergence  theorem,  we  con¬ 
clude  that  uniformly  in  y,  Hn(y,  Pn)  77*  (y),  where 

H*(y)  =  E{I( oo  >  Y  >  y)eW^*-rvr/2j. 

Moreover,  the  right  side  of  (A.l)  converges  to 

A*  =  E{AI(Y  <  oo)}  -  £  j/(T  <  oo). J  H*(y)dF*(y ) 

We  next  show  that  \k*  +  77*(y)|  is  bounded  away  from  0.  Because 
each  Fn{Yi }  is  nonnegative  and  Yff=  l  Fn{Yi}  =  1,  we  have 

l  =  Y  I(Yi<^)Ai  =y  7 (Yj  <  oo) A/ 

n(in  +  Hn(Yi9fin ))  "  n\kn  +  Hn(Yhfin)\ 

>  ly  7(7/<oo)A/ 

"  nh\\ln  +  Hn(Yi,fin)\+€’ 


where  cq  is  a  positive  constant.  This  implies  that  there  exists  a 
5*  >  0  such  that  \k*  +  T7*(y)|  >  <$*,  because  otherwise,  we  have  that 
infy  \k*  +  77*  (y)  |  =  0.  If  77*  (oo)  +7*  =  0,  then  77* (oo)  =  -7*  =  0. 
In  this  case,  \k*  +  T7*(y)|  =  77*  (y)  <  1  for  sufficiently  large  y,  which 
contradicts  (A.2).  If  7*  =  —  77* (yo)  for  a  finite  yo,  then  (A.2)  be¬ 
comes  1  >  cq  f£°  1  / 1 77*  (yo)  —  77*  (y)  |  dy.  This  is  impossible,  be¬ 
cause  77*  (y)  is  continuously  differentiable  in  a  neighborhood  of  yq. 
Furthermore,  \k*  +  77* (y)  |  >  <5*  implies  that  when  n  is  large,  \kn  + 
H„(y,  pn)\  >  S*.  Note  that  Fn(y)  =  n~l  £»=1  A iHXi  <  y)/\K  + 
Hn  ( Y( ,  jin)\  so  Fn  (y)  converges  uniformly  to  F*  (y)  =  E{  A7  (7  <  y)/ 
\k*  +  77*  (7)  | }. 

Based  on  the  definitions  of  S p(P,  F)  and  Sp(P,  F)[f  QF(h2)  dF], 
we  have 


Sp{Pn,Fny\YX+SF{Pn,Fn) 


;/ 


Qp  (h2)dFn 


=  0 


for  any  (hi,  h2)  e  77.  Using  the  uniform  convergence  of  (Pn,  Fn)  to 
(/?*,  F*),  as  n  ->  oo,  we  obtain 


Pjs^/?*,  F*)rhi  +  F*) 

On  the  other  hand,  we  also  have 

pjs0(/}o,£o)rhi  +  SF(/*o,fb) 


f  QF*(h2)dF* 

f  QF0(h2)dF0  J 


=  0. 


=  Pj  lp(P0,  F0y  ht  +  lF(P0,  F0) 
=  0. 


f  QFo(h2)dF0 


(A.3) 


because  of  our  construction.  Based  on  the  uniqueness  assumption  and 
continuity  of  Fq,  we  obtain  Pq  =  /?*  and  F*  =  Fq. 


Proof  of  Theorem  2 

Following  the  definition  of  77,  +Jn(Pn  —  Pq,  Fn  —  Fq)  can  be  treated 
as  a  linear  functional  in  the  metric  space  /°°(77),  which  is  defined  as 

~Jn(Pn  ~  00’  Fn  ~  £o)(hl’  h2) 

=  Vn(K  ~  0o)Thi  +  Vn  j  h2(t)d(Fn  -  F0). 

We  next  establish  the  asymptotic  distribution  of  +Jn(Pn  —  Pq,  Fn  —  Fq) 
in/00  (77). 

We  denote  the  derivative  of  S p(P,F)  with  respect  to  ft  as 
Spp(P,F),  the  derivative  of  S p(P,F)  with  respect  to  F  along 
the  path  F€  =  F  +  e(Fn  —  F)  as  SppiPi  F)[Fn  —  F],  the  deriva¬ 
tive  of  Sp (P,  F)[f  QF(h2)dF]  with  respect  to  P  as  SppiP,  F)  x 
if  QF(h2)dF],  and  the  derivative  of  SF(P,  F)[f  QF(h2)dF]  with 
respect  to  F  along  the  path  Fe  =  F  +  e(Fn  —  F)  as  Sff(P>  F)  x 
[ /  QF(h2)dF,  Fn  —  F].  Explicitly,  we  have 

Spp(P,F) 


for  any  positive  constant  e.  Because  Hn(y,  Pn)  converges  uniformly 
to  77*  (y),  we  have 


lA  7(7/  <  oo)A 

n 


--V- 

n 


7(7/  <00)A/ 


I  fn  +  Hn(Yi  ,Pn)\+€  nf^\k*  +  77*(7/)|  +  € 


0. 


Then,  after  taking  limits  on  both  sides,  we  obtain  1  >  F{A7(7  <  oo) / 
(| 7*  +  77*  (7)  |  +  e)}.  Let  6^0;  then  we  have 


1  > 


L 


c0dy 


o  r  +  ^WI 


(A.2) 


=  F(Y)ewTp~pTvp/2l\  -  (W  -  \P)(W  -  VP)T}, 
S FfiiP.F)^  QF(h2)dF 

=  -  fY  QF(h2)dF(t)ey,Tp~pTwp/2(W -XP), 

Jo 

SfiF(P,  F)[Fn  -  F] 

=  ~{Fn(Y)  -  F(Y)}e'NTp-pT'vp'2(W-  VP), 
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SFF(P,  F)  J  QF(h2)dF,Fn-F^ 

=  (Y  QFih2)diFn-F) 

Jo 

rp  rp  f*  OO 

+  (ew  P~P  yP!2F(Y)  -  I(Y  <  oo)A}  /  h2(t)d(Fn  -  F). 

Jo 

Our  estimating  equations  require  that 

P«  Fn)Th\  +  SF(pn,  Fn)  f  Qpn(h2)dFn J}=0. 

Noting  (A. 3),  we  obtain 

Vn(Pn  —  P){s^(^„,  Fn)Thi  +  SF(j}n,  Fn)  J  Qpn(h2)dFn 
=  — Fn)Th\  +  SF(Pn,  Fn)  f  QPn(h2)dF„ 

+  ■F’o)7'h1  +  Sf(Pq,  Fq)  f  QFo(h2)dF0  J. 


For  the  right  side,  simple  algebra  shows  that  uniformly  in  (hi, 
h2)eH, 

Fn)Thi+SF(pn,  Fn)  f  Qpn(h2)dF„ 

-SpiPo,  Fofh^  -  SF(P  o,  F0)  Q  Fo(h2)  d  Fq 

+  (K-Po)TSFp(Po,F0)y  QFo(h2)dF0 
+  h[S  pf[F„-F0] 

+  Sff(Pq,  Fq)  J  QFo(h2)dFQ,  Fn  —  Fq 
<  °p{\\Pn  ~  PoW  +  II Fn  ~  Fo\\l°°(H)}- 

Thus,  combining  this  with  the  expressions  of  Sy^Sy^S^y?,  and 
Spp,  we  obtain  that  the  right  side  of  (A.4)  equals 


(A.4) 

We  now  examine  the  left  and  right  sides  of  (A.4).  For  the  left  side, 
we  note  that  the  classes  {W}  and  {F(F)}  are  P-Donsker,  because  of 
their  monotonicity  and  uniform  boundedness.  Under  the  continuously 
differentiable  operation  of  taking  the  exponential,  and  the  algebraic 
operation  of  multiplication,  the  class 

|eWr/}-/JrV/}/2)  eWr0-0rV0/2(w  _  y/?) 

ewTP-PTyP/2(W-Vp)F(Y): 

Il0-0oll  <Sq,  swp\F(y)  -  F0(y)\  <S0! 

J  J 

also  is  P-Donsker  (see  van  der  Vaart  and  Wellner  2000,  thms. 
2.7.5,  2.10.6,  and  2.10.8).  In  addition,  the  class  {h2,QF(h2), 
fo  QF(h2)dF\ \\h2\\y  <  l,supy|F(y)  -  Fo(y)|  <  <5ol  contains 
functions  of  Y  with  bounded  variations,  and  so  also  is  P-Donsker. 
Therefore,  from  the  explicit  expressions  of  Sp  and  Sp,  the  preserva¬ 
tion  of  the  Donsker  classes  under  algebraic  operations  implies  that  the 
class 

A=  Jsj8(/S,F)rh1+5F(^,F)  J  QF(h2)dF  : 

IlhiH  <  1,  \\h2\\v  <  1,  ||0  -0o||+sup|F(y)  -  F0(y)|  <50] 

V  J 

is  P-Donsker.  On  the  other  hand,  based  on  the  consistency  of  $  and 
Fn,  the  bounded  norm  of  hi  and  the  bounded  total  variation  of  h2,  it 
is  straightforward  to  show  that 

Sp(pn,Fn)Thi+SF(Pn,Fn )  J  Qpn(h2)dFn^ 

Sp(fi0,  F0)rhi  +SF (0o,  F0)[/  Q F0(h2)  d Fq 
uniformly  in  (hi ,  h2)  e  TL.  Thus  the  left  side  of  (A.4)  is  equal  to 
^(P»-P){s^(0O:fo)rh1+SF(0o,Fo)[y’  QFo(h2)dF0 

+  op(  1), 

where  op(  1)  is  a  random  variable  that  converges  to  0  in  probability  in 
/°°(7Y).  As  a  result,  the  left  side  of  (A.4)  converges  weakly  to  a  mean-0 
Gaussian  process  in 


n\  (Pn~  Po)  (hl  ’  Qf0  ( h2 )) 


+  /  QF(^uQFo(h2))d(Fn-Fo)(y) 


+  op{^/n(\\Pn  —  Po\\  +  || Fn  -  Fq ll/oo (]-[))}, 


^/?(hU  QF0(h2)) 
=  E[Fo(Y)eWl 


Wr0-0rV0/2  ,v  _  /W  _ 


{V  -  (W  -  V0)(W  -  V0)7lh1] 


■  E  ev/TP~PT'vP/2(W  -  V0)  [Y  QFo(h2)dF0(t) 

J  0 


hi,  QF0(h 2)) 

=  E{e^TP-PTyP/2F0(Y)  -  I (Y  <  oo)Aj  QFo{h2(y)} 

-  E(l(y  <  Y)ewTP~PT^2[(W-\p)Thl  +  QFo{h2(y)}]) 

+  r  E(l(y<Y)eyfTP-PT^2 

Jo 

x  [fW  —  Y0)rh1  +  Qp0{h2(y)}])dF0(y). 

Note  that  the  last  term  in  the  foregoing  expression  is  added  to  en¬ 
sure  that  /  tip  (hi ,  Qfq  ( h2))dFo  =  0.  We  now  show  that  (Sip ,  SlP), 
which  is  a  linear  operator  on  a  subspace  of  H  to  itself,  with  mean-0 
S2p  under  Fq,  is  invertible.  Note  that 

fy?(hl’  Qf0(}i 2)) 


=  E  jsy^hi +SFy?  J  Qfq (h2)  j  j 

^dSF[fQF0(h2)dF0]l 

l  30  1  30  I 


dE (Sp\X,  Y,  A)  3 E(SF[f  Q Fo(h2) d Fq]\X,  Y,  A) 

- 30 - hl  + - 30 - 


=  E(lpphi+lFp  J  Qp0(h2)dFo  ^ 
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=  +lf  J  QFo(h2)dF0jl^ 

and,  similarly, 

p  oo 

J  nF(hi,QFo(h2))dG(y) 

=  E(hJSpF[G]  +  SFFy  QF0(h2)dF0,G^j 
=  E  [G]  +  SFFy  QFo(h2)dF0,  gJ|x,  7,  a) 

=  E(hTl\pF[G]  +  lFFy  QF0(h2)dF0,Gjj 
=  -E^hf  1  plF[G]  +  lF  y  QFQ(h2)dF0]lFlG] 


=  -E 


then  we  obtain 


for  an  arbitrary  function  G  such  that  dG(y)  =  0.  In  particular,  for 
G  =  f  Qfq  (h2)dFQ,  we  obtain 

poo 

/  £V(hi,  QF0(h2))QF0(h2)dF0 
J  0 

hfl plF  J  QF0(h2)dFo^  +  QFo(h2)dFo  ^ 


+  op{Fn(\\Pn  —  Po\\  +  ||  Fn  —  F0  ||,oo)}  +  op(  1) 

for  all  (h hh2)  e  n.  So,  -  p0)Thi  +  /0°°  QFo(h2)d(Fn  - 

Fq)}  =  Op(  1)  for  all  (h l,h2)  e  H,  and  thus  {Vn(||j?„  -  foil  +  II  At  - 
foil/  oo)}  =  0^,(1).  As  an  immediate  result, 

^{(fo -^o)rhi  +^°°  QFo(h2)d(Fn  -  F0) 

=  -M^n  -P){sfofo,  F0)rfl^1(h1,  QFo(h2)) 

+  sF(p0,F0)y  n-l(hhQFo(h2))dF0 

+  op(  1), 

and  thus  the  proof  of  the  theorem  is  complete. 

Proof  of  Theorem  3 

From  the  proof  of  Theorem  2,  if  we  take  QfqQi 2)  =  0,  then  we 
obtain  that 

\/»(fo  ~  Po)Thl 

=  -V^(P»-P){s^(^0,fo)7'^1(hl,0) 


If  for  certain  (hi,  h2)  and  an  arbitrary  G  satisfying  /q°° dG{y)  =  0, 
we  have 

(hi ,  Qf0  (h2))  =  0  and 

poo 

J  nF{hl,QF0(h2))dG(y)  =  0, 


+  sF<fto,F0) 


Ja? 


(hi ,  0 )dF0 


+  °p(  !)• 


Write 


poo 

0  =  h[flj(h1,GFo(&2))+  /  SiF(hi,QFo{h2))QFo{h2)dFo 

J  0 

=-hTlE(ipiTPhl+iFy  QFo(h2)dFo^y 

-E^hjlplp  J  Qp0(h2)dF0y  yy  Q  F0(h2)  d  Fq  j 
=  -F{(h[l,)2  +  2h[¥F[/  Qp0(h2)dF0 J 

+  (lF  f  QF0(h2)dFoy  J 

=  —E  j  +  Ip  J  QFo(h2)dF0  )  ; 

therefore,  with  probability  1,  h^I p  +  IfIJ  QfqQ1  2)dF$\  =  0.  When 
Y  =  00,  Ip  [/  Qf0  (^2)  <1Fq\  =  0,  and  thus  we  obtain  hi  =  0.  Subtract¬ 
ing  Ip  evaluated  at  (Y  <  00,  A  =  1)  by  its  value  at  (Y  <  00,  A  = 
0),  we  obtain  lF[f  Qp0(h2)  dF0](Y,  A  =  1)  -  lF[f  Qp0(h2)  dF0](Y, 
A  =  0)  =  Q F0{h2(Y)}  =  0  for  any  F  <  00,  and  thus  h2  =  0.  There¬ 
fore,  we  have  shown  that  (£2^,  £2p)  is  indeed  invertible.  Denoting  its 
inverse  (£2^  \  £2^),  we  can  rewrite  (A.4)  as 

V^j(fo  -Po)Thl  +fo 30  QF0(h2)d(Fn  -  Fq) 

=  -V^(Pn-P)Js^O?o,F0)rft^1(hi,eFo(/!2)) 

+  Sf(fo,F0)  f  n-l(huQFo(h2))dF0 


bi(y)  =  E{l(y<  Y)eyvT/}-PT'vl3/2(XV  -  Vj8)}, 

poo 

b2=/  bi(y)dF0(y). 

Jo 

ci  =  E{eV,Tp~pTyp/2F0(Y)  -  I(Y  <  oo)A}, 

c2(y)  =  E{l(y<Y)eV/Tf>-f>TVf>'2}, 


b3  = 


f 

.Jo 


{CI-C2O)}  ldFo(y ) 


n-1 


(A.5) 


X 


poo 

/  {bi(y)-b2}/{ci-c2(y)}r/Fo(y) 

Lio 


b4(v)  =  {ci  -  c2(7)}_1{b1(y)  -  b2  -  b3} 

uT  q_qT 


a1=F[F0(7)cW  v^2{V-(W-Vfo(W-Vfo7’}], 

a2  =  Ey,Tl}-F'vP/2( W- V/J)  JY  b4(y)T dF0(y) 


It  can  be  verified  that 


1(h1, 0)  =  (8l  -  a^-'hj  =  (A_1)rh1 


and 


fi-1(h1,0)  =  b4(y)r(A-1)7'h1, 


where  A  =  (ai  —  a2)T .  Thus 

V»(fo  -  0o)rhi 


=  ~Vn( Pn  -1 


Fq)  +  Sf(Pq,  Fq) 

+  °p(  !)• 


/ 


b4(y)i 


X  (A-‘)rhi 


Denote 


B  =  |  S/3(^0>  Fq)  +  SpiPo,  Fq) 


:/ 


b4(y)c/Fo 


02 
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where  a®2  =  aaT  for  a  vector  a.  Then  the  result  follows. 

Equivalence  of  the  NPMLE  and  Partial  Likelihood  Estimator 

The  partial  likelihood  estimator  for  the  proportional  hazards  cure 
rate  model  can  be  obtained  by  solving 


<oo)|Xf- 


i= 1 


and 


Y  A,  /(y,  <oo)  j,_Q 


=  0 


(A.6) 


(A.7) 


where  Rf  is  the  risk  set  at  Yj  and  X;  and  ft  represent  X  and  p  with  the 
first  component  (corresponding  to  the  intercept  b)  excluded  (see,  e.g., 
Tsodikov  1998b). 

In  the  NPMLE,  setting  the  Lagrange  multiplier  nX  = 
X.T  1 8 

J2yj=oo  e  j  H »  from  (3),  we  immediately  obtain 


Pi  = 


A iHYi  <  oo) 


and 


E 


■'jeRi 


e  J 


A kl (Y]c  <  °°) 


Yk<Yi  J 


and  thus 


-j^Rk 

eXlPXiAkI(Yk<co) 


i= 1 


i= 1  Yk<Yi 


-jeRk 


eXJP 


=  EA  kI<Xk«x>) 


k=  1 


and,  from  (5),  we  obtain 


EA,-/(^<oo) 

i=l 


(x, 


XTiP 


-jeRi 


e  J 


equivalent  to  (A.6).  From  (4),  we  have 


AA;/(7;<OQ)=i 
*=1  * 

equivalent  to  (A.7). 

[Received  September  2007.  Revised  February  2008.] 
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Bayesian  Semiparametric  Cure  Rate 
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ABSTRACT.  We  propose  a  Bayesian  semiparametric  model  for  survival  data  with  a  cure  fraction. 
We  explicitly  consider  a  finite  cure  time  in  the  model,  which  allows  us  to  separate  the  cured  and 
the  uncured  populations.  We  take  a  mixture  prior  of  a  Markov  gamma  process  and  a  point  mass  at 
zero  to  model  the  baseline  hazard  rate  function  of  the  entire  population.  We  focus  on  estimating  the 
cure  threshold  after  which  subjects  are  considered  cured.  We  can  incorporate  covariates  through  a 
structure  similar  to  the  proportional  hazards  model  and  allow  the  cure  threshold  also  to  depend  on 
the  covariates.  For  illustration,  we  undertake  simulation  studies  and  a  full  Bayesian  analysis  of  a 
bone  marrow  transplant  data  set. 

Key  words:  Bayesian  non-parametrics,  cure  model,  cure  threshold  model,  discrete  Markov 
gamma  process,  mixture  prior,  survival  analysis 


1.  Introduction 

In  the  study  of  time-to-event  data  for  certain  diseases,  there  exists  a  positive  probability  of 
not  observing  the  failure  event.  In  other  words,  there  exists  a  fraction  of  the  population  that 
is  cured  of  or  insusceptible  to  the  disease,  who  thus  will  never  experience  the  failure.  The 
survival  models  that  incorporate  a  cure  fraction  are  often  referred  to  as  cure  rate  models. 

Most  methods  for  modelling  survival  data  with  a  cure  fraction  have  been  rooted  in  two 
main  model  formulations:  the  standard  (mixture)  cure  rate  model  of  Berkson  &  Gage  (1952) 
and  the  alternative  (proportional  hazards)  cure  rate  model  of  Yakovlev  &  Tsodikov  (1996), 
which  was  later  studied  by  Chen  et  al.  (1999)  in  a  Bayesian  context.  The  standard  cure  rate 
model  (Berkson  &  Gage,  1952)  is  a  mixture  model  defined  by 

*Spop(0  —  7T  +  (1  —  7t)  S(t),  (1) 

where  n  e  (0, 1)  and  S(t )  is  a  proper  survival  function.  Note  that  lim^0  Spop(0  =  1  and 
lim^oo  £pop(/)  =  7i>0,  and  therefore  Spop(t)  is  an  improper  survival  function.  If  we  ignore 
the  impropriety  of  Spop(t),  and  carry  on  with  the  traditional  relationship  between  a  survival 
function  and  a  cumulative  hazard  function,  we  have 


Hpop(t)  =  -  log  Spop(t)  =  -  \og{n  +  (1  -  n)S(t)}, 


which  satisfies  that  lim^o  #pop(0  —  0  and  lim^oo  Hpop{t)  =  —  log  n  >  0.  If  we  proceed  and  take 
the  first  derivative,  then  the  hazard  rate  is 


^pop(0 


(l-7t)/(0 
71  +  (1  —  n)S(t)  ’ 


(2) 


where  f(t)  is  the  density  function  corresponding  to  S(t).  Now,  hpop(t)  has  the  following 
properties:  hpop(t)  =  (1  —  n)f  (0)  and  lim^oo/zpop(0  — 0.  Extensive  research  has  been 
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conducted  for  the  standard  cure  rate  model  in  the  literature  (Kuk  &  Chen,  1992;  Mailer  & 
Zhou,  1992,  1995;  Sy  &  Taylor,  2000;  Lu  &  Ying,  2004;  among  others). 

The  alternative  cure  rate  model  of  Yakolev  &  Tsodikov  (1996)  is  defined  by 

■SpopW  —  exp{— &F(/)},  (3) 

where  9>  0  and  F(t)  is  a  proper  cumulative  distribution  function.  This  model  satisfies  the 
conditions  that  lim^0  *Spop(0  =  1  and  lim,-^  Spop(t)  =  e-0;  therefore,  Spov(t)  is  not  a  proper 
survival  function.  Regardless  of  the  improper  condition,  the  cumulative  hazard  function  is 
given  by 

tfpop(0  =  “  loS  spop(t)  =  0F(t) 

which  leads  to  lim^0  FTpop(f)  =  0  and  lim,^  Hpop(t)  =  6>  0.  If  we  take  its  first  derivative,  then 
the  hazard  rate  is 

Vp  (0  =  ^#poP(0 = Qf '(*)>  (4) 

where  f(t)  is  the  density  function  corresponding  to  F(t).  We  have  that  lim^0^pop(0  —  0/(0) 
and  lim^oo/zpop(/)  =  0.  This  alternative  model  can  be  derived  within  a  biological  context 
related  to  metastasis-competent  tumour  cells,  and  thus  can  facilitate  a  nice  interpretation. 
More  recently,  Ibrahim  et  al.  (2001a)  and  Tsodikov  et  al.  (2003)  have  given  a  comprehensive 
review  and  discussion  of  the  current  development  of  the  cure  rate  model  (3)  from  both  the 
frequentist  and  Bayesian  perspectives.  Relevant  work  includes  that  of  Gray  &  Tsiatis  (1989), 
Peng  &  Dear  (2000),  Betensky  &  Schoenfeld  (2001),  Tsodikov  (2002),  Lam  &  Xue  (2005), 
and  Yin  &  Ibrahim  (2005),  among  others. 

In  the  semiparametric  cure  rate  models,  one  usually  concentrates  on  modelling  the  hazard 
rate  function  h(t)  of  the  uncured  group,  corresponding  to  S(t)  in  (1)  and  F(t)  in  (3),  respec¬ 
tively.  Ibrahim  et  al  (2001b)  proposed  a  piecewise  constant  hazard  function  to  model  the 
baseline  hazard  h(t).  More  specifically,  for  the  uncured  subjects, 

j 

h{t)  =  kjI{Sj-\  <t< Sj ), 

7  =  1 

where  0  <  s\  <  •  •  •  <  sj  with  sj>U ,  for  i=l,...,n.  The  number  J  controls  the  degree  to  which 
the  model  is  non-parametric  (its  ‘non-parametricity’  or  flexibility).  If  J  =  1,  h{t)  is  a  constant 
hazard  rate  and  the  larger  the  J  the  more  non-parametric  (flexible)  the  model.  In  addition, 
the  prior  is  adjusted  in  such  a  way  that  the  degree  of  parame tricity  in  the  right  tail  is 
controlled  by  a  smoothing  parameter.  However,  the  hazard  rate  h{t)  is  not  associated  with 
that  of  the  cured  group,  and  thus  the  implication  with  respect  to  the  hazard  rate  of  the  entire 
population  is  not  clear;  and  the  intervals  that  define  the  finite  partition  of  the  time  axis  are 
taken  to  be  data  driven. 

In  all  cure  models  known  in  the  literature,  the  researchers  model  a  positive  cure  proba¬ 
bility,  but  do  not  explicitly  quantify  the  finite  cure  time,  which  would  provide  very  useful 
information  with  direct  application  to  the  clinical  management  of  the  disease.  If  the  follow¬ 
up  of  a  patient  is  long  enough,  it  is  reasonable  to  consider  the  patient  cured  from  the  disease 
after  a  certain  time  point.  In  this  paper,  we  introduce  a  new  cure  rate  model  that  allows  us 
to  separate  the  cured  from  the  uncured  group  by  explicitly  modelling  a  finite  cure  threshold. 
For  this  purpose,  we  define  a  hazard  rate  with  a  cure  threshold  for  the  entire  population, 
including  both  cured  and  uncured  subjects.  Inference  of  the  model  is  driven  by  a  mixture 
non-parametric  prior  for  the  hazard  rate.  This  prior  is  established  from  a  mixed  gamma  pro¬ 
cess  with  the  marginal  distribution  at  each  partition  time  defined  by  a  mixture  of  a  gamma 
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distribution  and  a  point  mass  at  zero.  In  particular,  the  new  modelling  scheme  allows  us  to 
determine  the  subject-specific  threshold  after  which  an  individual  may  be  considered  cured. 

A  brief  description  of  the  paper  is  as  follows.  In  section  2,  we  define  the  new  cure  rate 
model  and  propose  the  non-parametric  prior  based  on  the  mixture  of  the  Markov  gamma 
process  and  a  point  mass  at  zero,  and  develop  its  properties.  In  section  3,  we  present  the  prior 
to  posterior  analysis,  and  derive  the  full  conditional  distributions  for  Gibbs  sampling.  In  sec¬ 
tion  4,  we  extend  the  new  method  to  cope  with  covariates  through  a  specification  similar  to 
the  proportional  hazards  model.  In  section  5,  we  illustrate  our  model  with  simulation  stud¬ 
ies  and  carry  out  a  full  Bayesian  analysis  of  a  real  data  set  from  a  bone  marrow  transplant 
study.  We  conclude  with  a  brief  discussion  in  section  6. 

2.  Cure  threshold  model 

In  order  to  formulate  a  meaningful  cure  rate  model,  we  examine  the  conditions  that  are 
required  for  the  hazard  function  of  the  entire  population  in  the  aforementioned  cure  rate 
models:  Hpop(t)  satisfies  (i)  lim,_>0#pop(0  =  0  and  (ii)  \\mt^00Hpop(t)  =  const.  <oo.  If  we  define 
ApopO)  =  dHpop(t)/dt ,  a  necessary  condition  for  (i)  and  (ii)  to  be  satisfied  is  that  lim^oo  hpop(t)  =  0. 

After  a  sufficient  follow-up,  the  uncured  subjects  would  have  experienced  the  event,  while 
the  cured  ones  would  be  either  censored  or  still  ‘alive’  in  the  study.  That  is,  after  the  cure 
threshold,  the  remaining  subjects  in  the  study  are  risk  free  of  the  event.  Following  this  route, 
we  propose  a  new  cure  rate  model  for  the  population  hazard  rate  function  that  vanishes  when 
t  exceeds  a  certain  threshold,  say  t,  that  is 

hvov  (t)  =  h(t)I(t<T),  (5) 

with  h(t)  a  non-negative  function.  This  new  specification  of  the  hazard  function  can  be 
interpreted  as  either  a  mixture  cure  model  (1)  or  a  proportional  hazards  cure  model  (3)  by 
constraining  the  event  time  of  the  uncured  group,  say  Tu ,  right-bounded  by  the  threshold  t, 
P(TU  <t)=  1.  In  practical  applications,  it  is  more  realistic  to  let  the  hazard  rate  drop  to  zero 
after  the  cure  threshold  as  the  subjects  who  have  survived  up  to  t  would  become  risk-free  of 
the  event.  The  cure  threshold  hazard  model  (5)  defines  a  valid  cure  rate  model  that  allows 
us  to  completely  identify  the  cured  and  the  uncured  groups. 

In  the  Bayesian  paradigm,  we  define  a  non-parametric  prior  for  h(t)  as  in  Walker  & 
Mallick  (1997), 

(X) 

h(t)  =  ^ XkI(xk - 1  <t< Tic), 

k=  l 

where  0  =  t0<ti<---  forms  a  partition  of  the  time  axis,  and  {Xk}  is  a  discrete  time  inde¬ 
pendent  gamma  process,  i.e.  Xkl~  Ga(a£,  pk),  where  Ga(a,  /?)  denotes  a  Gamma  distribution 
with  mean  a//f.  If  we  combine  this  prior  process  with  the  constraint  in  model  (5),  then  the 
cure  time  t  will  only  occur  in  discrete  periods,  which  can  be  accurately  estimated  with  a  fine 
partition.  Denoting  tz  as  the  discretized  cure  time,  then  condition  t  <  tz  can  be  replaced  by 
k<z,  and  thus  the  prior  on  the  entire  population  becomes 

oo 

hPop(t)  =  E  ^I{k  <  z)I(xk  _  i  <t<  Tk).  (6) 

k  =  1 

Moreover,  we  can  take  a  prior  on  tz  by  considering  a  prior  on  z.  If  we  denote  the  prior 
on  z  by  /(z),  then  the  new  process  {X*k}  with  Xk  =  XkI(k  <  z),  can  be  characterized  by 

/(4 1  z)  =  Ga (4 1  ak,  pk)I(k  <  z)  +  7(4  =  0)1  (k  >  z). 
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Marginalizing  over  z,  the  prior  distribution  of  X*k  becomes 

/(4) = %Ga(4 1  «*»  At) + 0  -  VkVOk = °). 

with  rjk  =  P(z>£),  i.e.  X*k  has  a  prior  distribution  given  by  a  mixture  of  a  Gamma  distribution 
and  a  point  mass  at  zero. 

The  first  and  second  moments  of  the  process  {h*k}  are  the  following: 

and 

Var(4)=f7jt(l-^)  (jCj  +t]k p-. 

Due  to  the  existence  of  z,  X*k  and  A*k+l  are  not  independent,  in  fact,  the  covariance  is  given  by 

Cov(4+1,4)  =  %+i(l -«/*)(^)  (£)•  (7) 

It  is  worth  noting  that  this  non-parametric  prior  differs  substantially  from  other  non/semi- 
parametric  constructions  for  cure  rate  models.  In  light  of  the  new  model  (5),  we  non- 
parametrically  model  the  hazard  rate  of  the  entire  population,  whereas  Ibrahim  et  al.  (2001b), 
for  instance,  model  the  hazard  rate  for  the  non-cured  subjects  only.  If  rjk  =  1,  i.e.  the  time 
interval  k  lies  before  the  cure  index  z,  E(2£)  =  a k/pk,  Var(2£)  =  akip2k  and  Cov(2£+1, 2£)  =  0, 
which  reduces  to  the  independent  piecewise  exponential  model. 

To  enhance  the  dependence  in  the  process  {hk}  and  to  further  model  trends  in  the 
hazard,  we  can  consider  the  Markov  gamma  process  of  Nieto-Barajas  &  Walker  (2002)  for 
the  kk’s  instead  of  taking  them  to  be  independent.  As  a  result,  we  induce  a  stronger  corre¬ 
lation  within  the  X*k’s.  The  Markov  gamma  process  {Xk}  is  defined  through  a  latent  process 
{uk}  in  the  following  way:  start  the  process  with 

2i~Ga(ai,ft) 

and  then  take 

w*|4~Po(c*4) 

and 

+ 1 1  Uk  ^  Ga (cLk  _|_  i  T  uk ,  fik  _)_  ]  T  cjd) 

for  k=  1,2, _  Here,  Po (c)  denotes  a  Poisson  distribution  with  intensity  c.  If  we  take  uk 

and  /4  to  be  constant,  then  the  process  {Xk}  is  stationary  with  marginals  2^~Ga(a,  /?)  and 
correlation  Corr(2^ + 1 ,  hk)  =  ckl(P  +  ck). 

Clearly,  if  z>k  for  all  k  with  probability  1  (w.p.l),  i.e.  there  is  no  (finite)  cure  time,  the 
prior  (6)  reduces  to  the  case  of  Nieto-Barajas  &  Walker  (2002).  If  ck  =  0  for  all  k ,  then  uk  =  0 
w.p.l,  which  implies  that  {k*k}  reduces  to  the  previous  case  in  (7)  where  the  dependence  is 
very  weak.  This  formulation  of  the  prior  represents  a  new  generalization  of  the  independent 
gamma  process  (Walker  &  Mallick,  1997).  It  enhances  the  flexibility  and  capability  of  the 
hazard  structure  to  model  survival  data  with  a  possible  cure  fraction. 

The  prior  modelling,  given  by  equation  (6),  implies  that  the  cure  fraction  n  defined  as  the 
proportion  of  the  population  that  will  never  experience  the  failure,  is  given  by 

71  =  lim  Spop(0  =  exp  h<Ck  -  t*  _  i)  >, 

l  k=  1  ) 
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since  Xk  is  taken  to  be  zero  for  k>z.  Note  that  the  sum  in  the  expression  for  n  is  a  finite 
summation  bounded  by  z.  This  is  due  to  the  fact  that  the  paths  of  our  non-parametric  prior 
characterize  that  the  hazard  rate  takes  the  value  of  zero  from  tz  onwards,  whereas  in  the 
standard  mixture  and  the  proportional  hazards  cure  rate  models,  the  hazard  rate  vanishes 
asymptotically  or  decays  toward  zero. 

One  needs  to  specify  a  prior  distribution  for  z,  which  has  a  support  on  {1,2,. . A  natu¬ 
ral  option  is  to  consider  a  positive  Poisson  prior  distribution,  i.e.  z  ~  Po+  (/i),  for  jjl  >  0,  where 
Po+  (c)  is  a  positive  Poisson  distribution  with  parameter  c,  that  is,  if  v  ~  Po+  (c)  then  v  —  1  ~ 
Po(c).  Although  the  sample  paths  of  our  prior  hpop(t)  are  piecewise  constant  almost  surely 
(a.s.),  the  sample  paths  of  the  corresponding  cumulative  hazard  function  Hpov(t )  (integrated 
process)  are  continuous  a.s.  Therefore,  the  non-parametric  prior  Spop(0  — e_//pop(^  assigns  a 
positive  probability  to  the  set  of  continuous  (improper)  survival  functions,  which  allows  for 
a  finite  cure  time. 

3.  Prior  to  posterior  analysis 

Let  /zpop(0  be  the  prior  hazard  rate  as  in  (6).  Then,  the  cumulative  hazard  function  Hpov(t) 
has  the  form  of 

(X) 

tfpop(fi)  ^  ^  XkI(k  ^  z)wki, 
k  =  i 

where 

{^k  ^k  —  In  '>  ^k 

ti-Tk-\,  ue( Zk-uZk\  (8) 

0,  otherwise. 

Through  fVoV(ti)  =  hvov(ti)Q~H^di\  the  density  function  /pop(0  can  be  easily  derived. 

Let  X  and  u  denote  the  vectors  of  Xk  and  uk,  A(_*)  and  u(_^  denote  the  rest  after  deleting 
the  kt h  component,  respectively.  The  joint  prior  distribution  of  (X,  u,  z)  can  be  written  as 

/a,u,z)=/a,u)/(z), 

with 

oo 

u)  =  Ga(Ai | ai , ft )  ft  {Po(«/t|cj:4)Ga(4+x|a/t+i  +uk,pk+l  +ck)}  (9) 

k  =  1 

and  /(z)  =  Po+  (z\n). 

Let  T  =  (Ti,. . .,  Tn)  denote  a  sample  of  size  n  from  /pop(0  with  possibly  right-censored 
observations.  Without  loss  of  generality,  let  us  assume  that  (Ti,...,  Tnu)  are  exact  failure 
observations  and  (Tnu+ 1,...,  Tn)  are  right-censored.  Thus,  the  likelihood  for  (X,u,z)  is 

nu  n 

lika,  u,z|t)  =  n  KopM  n  e-HrBr("\ 

i= 1  i= 1 

which  can  be  expressed  as 

oo 

lik(4u,z|t)=P[  {XkI(k<z)Yk  (10) 

k  =  1 

where 

nu  n 

rk  =  y^/fa-1  <f,-<Tt)  and  mk  =  ^2,wki,  (11) 

1=1  i= 1 

with  wki  given  in  (8). 
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Therefore,  the  derivation  of  the  posterior  full  conditional  distributions  is  straightforward. 
The  posterior  conditional  distribution  of  is 


/(4 1 4-*) ,  u,  z,  t)  =  Ga(4 1  ixk  +  uk.i+uk  +  rk,l)k+ck.i+ck  +  mkI(k<z))  (1 2) 


with  c0  =  0  and  w0  =  0  w.p.l.  The  latent  variables  u  do  not  appear  in  the  likelihood;  thus,  their 
posterior  conditional  distributions  are  the  same  as  the  prior  conditional  distributions,  i.e. 

f(uk | X,  U(-fc)) oc  y(\  -\-ui )r(a4  ~~\~Uk)  ck)^k^k+\\  kI{o,i,...}(uk).  (13) 

Finally,  the  posterior  conditional  distribution  of  z  is  given  by 


/ (z\k,  t) oc/ (z) exp 


y  mkI(k  <  z)/.k 
k  =  1 


I(k*<z). 


(14) 


Note  that  the  likelihood,  given  in  (10),  imposes  a  constraint  on  the  possible  values  of  z.  The 
likelihood  is  zero  for  all  values  of  z  such  that  there  exists  a  k>z  that  r^ 0.  Thus,  the  full 
conditional  distribution  is  different  than  zero  only  for  k*  <  z,  where  k*  is  the  largest  value  of 
k  such  that  the  interval  has  at  least  one  exact  observation. 

The  parameter  z  plays  a  critical  role  in  the  model.  In  particular,  z  determines  the  threshold 
tz,  such  that  if  a  patient  survives  up  to  that  time,  then  the  patient  can  be  considered  cured  of 
the  disease.  Remember  that  a  priori  z  ~  Po+  (p),  to  let  the  data  drive  the  posterior  inference 
for  p,  we  take  a  hierarchical  model  by  assigning  a  hyper-prior  distribution  for  p.  If  we  take 
p~Ga(ao,bo),  then  the  posterior  conditional  distribution  of  p  given  all  other  parameters  in 
the  model  only  depends  on  z  and  is  given  by 


f(p\z)  =  Ga (p  |  a0  +  z  —  1 , b0  +  1). 


Finally,  we  can  estimate  tz  with  tz,  where  z  is  the  posterior  median,  or  to  be  conservative, 
z  could  be  an  upper  quantile  of  the  posterior  distribution  of  z,  i.e.  z  is  the  smallest  value 
s  such  that  P(z  <  s)  =  1  —  a,  for  a  small  a.  Hence,  in  order  to  obtain  posterior  inferences,  a 
Gibbs  sampler  (see,  for  example,  Smith  &  Roberts,  1993)  with  these  full  conditional  distri¬ 
butions  needs  to  be  implemented. 


4.  Covariate  information 

In  the  standard  mixture  and  proportional  hazards  cure  rate  models,  co variates  can  be  incor¬ 
porated  by  modelling  n  or  6  as  a  function  of  the  covariates.  For  the  mixture  cure  rate  model, 
Kuk  &  Chen  (1992)  modelled  n  with  a  logistic  regression  model,  which,  according  to 
Chen  et  al.  (1999),  has  several  drawbacks:  it  does  not  have  a  proportional  hazards  structure 
for  the  entire  population  and  yields  improper  posterior  distributions  for  some  non- 
informative  improper  priors.  For  the  proportional  hazards  cure  model,  Chen  et  al  (1999) 
introduced  covariates  through  the  parameter  6 ,  in  order  to  have  a  proportional  hazards 
structure  for  the  entire  population. 

To  conduct  a  covariate  analysis,  we  propose  a  semiparametric  model  which  is,  in  spirit, 
similar  to  the  proportional  hazards  model  of  Cox  (1972).  The  ‘baseline’  hazard  is  modelled 
by  (6)  to  explicitly  impose  a  cure  fraction  and  a  cure  threshold.  In  addition,  we  equip  the 
model  with  more  flexibility  in  such  a  way  that  each  individual  can  have  a  different  cure  time 
indexed  by  zt.  Then,  for  individual  i  with  a  possibly  time-dependent  p-co variate  vector  x//, 
we  propose  the  hazard  function  to  be  of  the  form 
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hi(t\xi,zi)  =  h(t\zi)e^‘\  (15) 

where  y  is  a  vector  of  regression  parameters  and  h(t  \  zt)  is  the  ‘baseline’  hazard  function.  We 
assign  h{t  \  zt)  a  non-parametric  mixture  prior  that  is  capable  of  modelling  a  cure  fraction  and 
a  cure  time,  i.e. 

oo 

h(t  |  Zi)  =  22  <  Zi)I(zk  _  i  <t<  zk), 

k  =  1 

where  {^}  is  a  Markov  gamma  process  common  to  all  subjects  and  zf  is  the  cure  threshold 
index  for  subject  i.  Our  non-parametric  prior  on  h(t  \  zt)  allows  a  patient  to  be  considered  at 
risk  before  time  tz.,  and  to  be  considered  cured  if  the  patient  survives  beyond  time  tZ/. 

Model  (15)  is  not  a  proportional  hazards  model  due  to  different  cure  thresholds  indexed 
by  zt.  This  distinctive  feature  allows  us  to  estimate  the  subject- specific  threshold  by  assigning 
a  prior  distribution  / (zt)  dependent  on  the  co variates,  for  instance 

Po+  (es'yi), 

where  S  is  a  vector  of  unknown  coefficients  and  y,  is  a  ^-vector  of  fixed  covariates.  Note  that 
xz  and  yz  may  share  common  covariates. 

In  this  case,  the  cumulative  hazard  function  becomes 

oo 

Hi(t  |  x,-,  z;)  =  22  hl(k  <  Zi)wki(t,  x„  y), 
k  =  1 

where 

{J2_ ,  exp{y'x,(s)}  ds,  t  >  Tk 

f‘k _ ,  expj/x,^)} ds,  te(zk_u zk] 

0,  otherwise. 

Given  a  sample  of  size  n ,  let  nu  be  the  number  of  failures,  i.e.  (G,. . .,  TUu)  are  the  observed 
event  times,  and  (T„u  + 1  ,. . .,  Tn)  are  right-censored  observations.  The  likelihood  function  is 

which  can  be  re-expressed  as 

}oo  r  riu 

J-J  tfk e-mk(y,z)Xk  -  1  ^ 

k  =  1  L  i  —  1 

where 

nu  n 

rk  =  22  - 1  <ti<  zk)  and  mk(y,  z)  =  22  x«,  V)- 

i  =  1  /  =  1 

Then,  we  can  obtain  the  posterior  full  conditional  distributions,  which  are  needed  for  the 
implementation  of  Gibbs  sampling.  The  full  conditional  distributions  of  and  uj,  are  given 
in  a  form  similar  to  the  non-parametric  case  without  co  variates,  i.e.  the  conditional  distribu¬ 
tion  of  Xk  given  the  rest  is 

/(4 1 y,  k-k), i'  data)  =  Ga(4 \txk  +  uk-i+uk  +  rk,Pk  +  ck-i+ck  +  mk(y, z)) 
with  cq  =  0  and  uo  =  0  w.p.  1 ,  and 

f(uk  I  y,  A,  U(_jfc),  z,  data)  =f(uk  |  A,  u^)), 
given  by  equation  (13).  The  full  conditional  distribution  of  zt  is 
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/ (zj  1 5,  y,  k,  data)  oc  exp 


T:  hl(k  <  Zi)wki(ti ,  Xf,  y)  >  Po+  (zz  I  ed  yi)I(ki  <  zt). 


Note  that  the  likelihood  imposes  a  lower  bound  kt  for  zi9  where  kt  is  the  interval  where  the 
observation  tt  occurred.  If  tt  is  censored  then  there  is  no  lower  bound,  so  kt  =  1 .  Let  / (y)  be 
the  prior  distribution  for  y,  then 

(  nu  oo 

/( y  I 4  i  data)  oc f{y) exp  l  ^  y 'x, •(<,■)  -  ^  mk(y,  z)4 

W  =  1  fc  =  l 


Finally,  we  assign  a  hyper-prior  distribution,  / (<5),  for  the  vector  of  coefficients  S,  so  that 
the  data  can  help  us  to  determine  the  co variate  effects  on  the  cure  threshold.  Then,  the  full 
conditional  distribution  of  S  only  depends  on  the  cure  threshold  index  zi9  given  by 

f(d  I  z)  oc  f{8)  exp  |  (Vy ,(z,  -  1 )  -  e'5'5'1' ) 


As  in  the  case  without  co  variates,  the  cure  time  tZ/,  after  which  an  individual  would  be 
considered  cured,  can  be  estimated  by  rz.,  where  zt  is  an  upper  quantile  of  the  posterior 
distribution  of  zi9  which  can  be  determined  by  the  co  variates. 

Based  on  the  semiparametric  model  (15),  the  probability  of  not  experiencing  failure  for  an 
individual  with  covariates  xz(/)  and  cure  threshold  zi9  is  given  by  ni  =  \imt^00  Si(t\xi,  zt),  and 
more  precisely, 


7iz  =  exp  |  —  ^  kk  f  d^ 

t  k  =  \  ''h- 1 

This  gives  us  a  way  to  characterize  the  co  variate  effects  on  the  cure  fraction  n  /.  A  negative 
regression  coefficient  y,  for  example,  leads  to  a  larger  probability  of  not  experiencing  the  fail¬ 
ure  (i.e.  a  higher  cure  fraction)  when  increasing  the  value  of  the  corresponding  co  variate. 


5.  Numerical  study 

We  demonstrate  the  properties  of  our  cure  threshold  model  with  simulation  studies  in 
examples  1  and  2,  and  then  illustrate  its  application  to  a  bone  marrow  transplant  data  set  in 
example  3. 

Example  1.  First,  we  simulated  data  from  the  proportional  hazards  cure  rate  model  of 
Yakolev  &  Tsodikov  (1996)  by  taking  a  baseline  density  with  a  bounded  support.  We  took  a 
triangular  distribution  Tri (a,  c ,  b)  as  the  baseline  density,  which  puts  a  probability  of  one  to 
the  interval  [< a ,  b]  and  the  mode  at  c.  In  particular,  we  chose  a  =  0,  c  =  1  and  b  —  4.  The  censor¬ 
ing  time  was  independently  generated  from  a  uniform  distribution  to  yield  a  30%  censoring 
rate.  We  took  the  sample  size  n=  100  and  the  cure  proportion  e~0  =  O.2O. 

We  took  a  fixed  time  partition  with  t0  =  0  and  zk  =  zk  _  i  +  A,  with  A  =  0. 10,  for  k  =  1 , . . . ,  100. 
We  considered  different  values  of  (ak,pk,ck)  and  for  the  hyper-prior  distribution  on  p  we 
took  a0  =  bo  =  0.01  in  order  to  be  vague.  In  all  scenarios,  we  ran  the  Gibbs  sampler  for  10,000 
iterations  with  a  burn-in  period  of  1000. 

We  used  the  logarithm  of  the  pseudo-marginal  likelihood  (LPML)  statistic  as  a  model 
selection  criterion  (see,  for  example,  Sinha  &  Dey,  1997).  Table  1  summarizes  this  statistic 
as  well  as  several  other  posterior  estimates  for  different  sets  of  the  hyper-parameters  when 
fitting  our  semiparametric  model  (6)  to  the  simulated  sample. 
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Table  1.  Posterior  estimates  of  LPML,  cure  threshold,  poster¬ 
ior  mean  and  95%  credible  interval  (Cl)  for  n,  for  a  simulated 
sample  of  size  72=100  with  a  triangular  baseline  density 


(Xk 

At 

Ck 

LPML 

Tz 

ft 

95%  Cl 

0.01 

0.01 

0 

-153.11 

7.1 

0.188 

(0.110,  0.272) 

0.1 

0.1 

0 

-149.91 

4.2 

0.176 

(0.105,  0.259) 

1 

1 

0 

-134.42 

3.6 

0.133 

(0.082,  0.193) 

1 

1 

5 

-130.35 

3.8 

0.169 

(0.103,  0.246) 

1 

1 

20 

-127.33 

4.0 

0.173 

(0.098,  0.254) 

1 

1 

50 

-125.48 

4.1 

0.178 

(0.111,  0.259) 

Table  1  shows  that,  according  to  the  LPML  statistics,  the  best  fit  is  achieved  with  either 
one  of  the  last  two  sets  of  hyper-parameters  at  the  bottom  of  the  table,  which  correspond 
to  dk  =  fk  =  1  and  Ck  =  20  and  50.  With  our  mixture  prior  for  2*k,  a k  =  =  1  makes  the  con¬ 

tinuous  part  of  the  mixture  very  different  from  the  point  mass  at  zero,  and  thus  the  model 
can  easily  capture  the  change  point  z.  Otherwise,  if  and  fk  are  verY  small  to  induce  a  large 
prior  variance,  the  model  may  not  distinguish  well  between  the  continuous  part  and  the  point 
mass  at  zero.  The  posterior  estimate  of  the  cure  time  is  based  on  the  95%  quantile  of  the 
posterior  distribution  of  z,  i.e.  Tz  —  Axz  where  z  is  the  95%  quantile.  We  can  see  that  the 
two  best  fitting  models  according  to  the  LPML  statistic  can  accurately  estimate  the  true  cure 
time,  4,  and  the  true  cure  rate,  0.20.  For  a  comparison  with  other  (frequentist)  methods,  we 
computed  the  estimate  of  the  cure  proportion  based  on  the  work  of  Mailer  &  Zhou  (1992) 
and  obtained  71  =  0.182  with  a  95%  confidence  interval  (0.102,  0.263),  which  is  comparable 
to  our  estimates. 

In  Fig.  1,  we  present  the  hazard  rate  estimates  for  the  last  four  sets  of  parameters  in 
Table  1,  and  include  95%  credible  intervals  together  with  the  true  hazard.  The  credible 


Time  Time 


Fig.  1.  Posterior  hazard  rate  estimates  (solid  line)  together  with  95%  credibility  intervals  (dotted  line). 
The  dashed  line  corresponds  to  the  true  hazard.  Top-left  Ck  =  0,  top-right  Ck  =  5,  bottom-left  Ck  =  20 
and  bottom-right  Ck  =  50. 
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Fig.  2.  Posterior  distributions:  /t  (left  panel)  and  z  (right  panel). 


intervals  were  calculated  as  the  2.5%  and  97.5%  quantiles  of  the  posterior  distributions.  We  can 
see  that  the  hazard  estimates  are  smoother  as  c k  increases  and  the  credible  bands  become  nar¬ 
rower.  The  posterior  distributions  of  /ll  and  the  cure  threshold  z  are  reported  in  Fig.  2.  These 
graphs  correspond  to  the  last  set  of  hyper-parameters  in  Table  1,  that  is  (a*-,  pk,  ck)  =  (1, 1,  50). 
The  posterior  median  of  /i  is  32.8,  which  is  located  near  the  lower  bound  of  the  distribution 
of  z,  33.  This  suggests  that  the  50%  of  the  mass  in  the  lower  part  of  the  distribution  of  /i 
collapses  into  one  single  point  (the  lower  bound)  in  the  distribution  of  z  due  to  the  constraint 
at  the  last  exact  observation. 

To  examine  the  frequentist  properties  of  our  estimates,  we  calculated  the  coverage  per¬ 
centage  of  the  95%  credible  intervals  for  tz  and  n  based  on  500  samples  of  size  100.  If  we 
take  (a*;,  fik,  ck)  =  (1, 1,  50),  then  the  coverage  rate  of  the  95%  credible  interval  is  91.7%  for  tz, 
and  94.8%  for  n.  Therefore,  our  Bayesian  credible  intervals  preserve  the  desirable  frequentist 
properties. 


Example  2.  In  the  second  example,  we  intend  to  compare  the  proposed  model  with  the 
traditional  cure  rate  models  that  cannot  model  the  finite  cure  time.  We  simulated  data  from 
the  proportional  hazards  cure  model  of  Yakolev  &  Tsodikov  (1996)  with  the  baseline  den¬ 
sity  given  by  an  exponential  distribution  with  mean  1.  In  contrast  to  example  1,  the 
hazard  gradually  decays  to  zero  instead  of  dropping  to  zero  at  a  finite  cure  time.  We  applied 
the  proposed  cure  threshold  model  so  as  to  capture  a  finite  cure  time  in  this  Traditional’ 
case.  The  censoring  time  was  independently  generated  from  a  uniform  distribution  to  yield 
25%  of  the  censoring  times,  and  we  took  a  sample  size  of  72=100  and  a  cure  proportion 
e-0  =  0.25. 
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Table  2.  Posterior  estimates  of  LPML,  cure  threshold,  posterior 
mean  and  95%  Cl  for  n,  for  a  simulated  sample  of  size  n=  100 
with  an  exponential  baseline  density 


a* 

h 

ck  LPML 

it 

95%  Cl 

0.01 

0.01 

0  -120.73 

10.2 

0.249 

(0.166,  0.340) 

0.1 

0.1 

0  -120.13 

5.9 

0.229 

(0.151,  0.317) 

1 

1 

0  -117.40 

5.1 

0.124 

(0.075,  0.182) 

1 

1 

5  -111.09 

5.4 

0.193 

(0.127,  0.269) 

1 

1 

20  -108.77 

5.9 

0.219 

(0.143,  0.302) 

1 

1 

50  -107.47 

6.3 

0.232 

(0.152,  0.321) 

We  partitioned  the 

time 

axis  by  setting 

T0  =  0 

and  Tfc  =  Tfc  _  i  +  A,  with  A  =  0.1 5,  for 

k  =  1,. . .,  100.  We  took 

Cl{)  = 

%  =  0.01  for  the 

prior 

on  /a 

and  considered  different  sets  of 

hyper-parameters  for  the  Markov  gamma  process  of  the  baseline  hazard.  We  ran  the  Gibbs 
sampler  for  10,000  iterations  with  a  burn-in  period  of  1000.  Table  2  summarizes  the  model 
selection  statistic  LPML  and  posterior  estimates  of  tz  and  n.  According  to  the  LPML 
statistic,  the  best  fitting  models  are  obtained  when  otk  =  fk  =  l  and  Ck  =  20  or  50. 

For  the  last  three  cases  at  the  bottom  of  Table  2,  the  95%  credible  intervals  for  n  contain 
the  true  value  of  the  cure  rate,  0.25.  If  we  compute  the  estimate  of  the  cure  proportion  by 
Mailer  &  Zhou  (1992),  we  obtain  a  value  of  0.249  with  a  95%  confidence  interval  (0.160, 
0.337).  Although  our  cure  threshold  model  does  not  exactly  match  the  model  that  generated 
the  data,  due  to  the  difference  in  the  existence  of  a  cure  threshold  or  not,  our  model  managed 
to  correctly  estimate  the  cure  rate. 

Furthermore,  the  hazard  and  the  survival  functions  are  also  well  estimated.  In  Fig.  3,  we 
can  see  that  the  true  hazard  and  survival  functions  lie  between  the  95%  credible  bands  when 
taking  (otk,  pk,ck)  =  (  1,1,50). 


Fig.  3.  Posterior  estimates  (solid  line)  and  95%  Cl  (dotted  line):  Hazard  rate  (left  panel)  and  survival 
function  (right  panel).  The  dashed  line  corresponds  to  the  true  hazard/survival  function. 
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Considering  the  last  parameter  specification  in  Table  2,  and  noting  that  the  data  were 
simulated  from  the  model  with  no  cure  threshold,  then  an  estimated  cure  time  of  6.3  corre¬ 
sponds  to  the  99.94%  quantile  of  the  uncured  group.  This  would  mean  that  there  is  a  proba¬ 
bility  of  0.0006  of  observing  an  event  after  this  estimated  ‘cure  time’.  Therefore,  if  a  patient 
has  survived  up  to  t$,  the  patient  is  practically  cured.  In  practice,  a  cure  threshold  would 
provide  very  useful  information  and  guidance  for  the  clinical  management  of  the  disease.  In 
other  words,  our  model  is  able  to  adequately  estimate  the  cure  proportion,  as  well  as  provid¬ 
ing  additional  information  about  the  possible  cure  time. 


Example  3.  As  a  final  illustration,  we  applied  the  proposed  method  to  a  bone  marrow 
transplantation  (BMT)  data  set  (Avalos  et  al .,  1993).  BMT  is  a  procedure  to  restore  stem 
cells  by  replacing  bone  marrow  that  has  been  destroyed  as  a  result  of  treatment  with  high 
doses  of  anticancer  drugs  or  radiation  therapy.  The  goal  is  for  the  healthy,  transplanted  stem 
cells  to  restore  the  bone  marrow’s  ability  to  produce  functioning  blood  cells  in  numbers  that 
are  adequate  for  good  health. 

Patients  represented  in  the  data  set  had  been  diagnosed  with  Hodgkin’s  disease  or  with 
non-Hodgkin’s  lymphoma,  and  had  undergone  one  of  two  types  of  BMT:  (i)  allogeneic  BMT 
(infusion  of  bone  marrow  from  a  matched  sibling  donor),  or  (ii)  autogeneic  BMT  (rein- 
fusion  of  the  patient’s  own  marrow  that  had  been  removed  prior  to  marrow-destroying  treat¬ 
ment).  Of  43  BMT  patients,  16  had  undergone  an  allogeneic  transplant  (x\  =  1)  and  27  an 
autogeneic  transplant  (x\  =0).  The  objective  of  the  study  was  to  evaluate  the  leukaemia-free 
survival  difference  between  the  two  BMT  groups.  Time  to  death  or  relapse  was  recorded  in 
days.  Patients  were  also  categorized  by  their  diagnosis  of  Hodgkin’s  disease  (x2  =  1)  or  non- 
Hodgkin’s  lymphoma  (x2  =  0).  Other  covariates  included  in  the  analysis  were  the  Karnofsky 
score  at  transplant  (2C3)  and  the  waiting  time  in  months  from  diagnosis  to  transplant  (V4).  The 
Karnofsy  score  provides  a  general  indication  of  a  patient’s  wellness  or  level  of  disability  due 
to  disease  or  illness. 

In  Fig.  4,  we  show  the  estimated  Kaplan-Meier  survival  curves  stratified  by  the  transplant 
groups.  For  the  BMT  patients,  there  is  a  clear  stable  plateau  after  approximately  one  and 
a  half  years  of  follow-up.  This  is  strong  empirical  evidence  that  some  percentage  of  patients 
in  this  population  might  be  cured. 


Fig.  4.  Kaplan-Meier  survival  curves  stratified  by  transplant  groups:  ( — )  allogeneic  group  and  (•  •  •) 
autogeneic  group. 
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We  implemented  our  semiparametric  model  described  in  section  4  with  the  four  covariates 
(x\ , . . . ,  X4)  in  the  multiplicative  part  of  the  hazard,  and  the  same  four  co variates  to  determine 
the  cure  threshold  for  each  patient,  i.e.  yj  =  Xj  for  j=  1,. .  .,4.  We  defined  a  fixed  time  parti¬ 
tion  of  the  form,  t0  =  0  and  Tk  =  xk  _  1  +  A,  with  A  =  30  (days)  for  k  =  1 , . . . ,  72.  We  considered 
different  sets  of  hyper-parameters  (otk,pk,ck)  and  assigned  vague  independent  priors  to  the 
regression  parameters  ys,  that  is,  ys*~N( 0, 100)  for  s  =  1,. .  .,4,  and  vague  independent  priors 
for  the  coefficients  Sj,  i.e.  Sj  N( 0, 100)  for  j=  1, . . . ,  4. 

Simulating  from  the  posterior  full  conditional  distributions  of  X,  u  and  z  is  straightforward. 
However,  in  order  to  simulate  ys  and  dj,  we  implemented  Metropolis-Hastings  steps  (see,  for 
example,  Tierney,  1994),  described  as  follows: 

(i)  M-H  step  for  ys :  At  iteration  r  +  1 ,  we  simulated  y*  from  N(y(sr\  a2),  and  took  yir+1)  =  y* 
with  probability  p(y*,  y(/})  or  yir+1)  =  yir)  with  probability  1  —  p(y*s,  yP),  where 


fifs) 


exp 


5^(y:  -  y  irV«(0 + E  Wh(r)> z)  -  mk(y*,  z)}4 


(ii) 


with 

M-H  step  for  df.  At  iteration  r+ 1,  we  simulated  <5*  from  N{d(j\a1),  and  took 
Sp  =  8*  with  probability  q(5*,8jr))  or  <rj  + 1 '  =  5^  with  probability  1  —  q(8*,  Sy1).  where 


?0;,^r))  =  min|l,^fexp  £(S* 

with  d*  =  (5{p 1), . . . ,  <5f_t 1),  <5 ydfl 


l)-(e'*'y' 


We  carried  out  a  random  walk  algorithm  with  <r2  =  25  for  the  variance  of  the  proposal 
distribution  in  each  case.  We  ran  the  Gibbs  sampler  for  100,000  iterations  using  a  burn-in 
of  20,000.  The  posterior  estimates  of  the  LPML  statistic  are  summarized  in  Table  3,  from 
which  we  can  see  that  there  is  little  difference  in  the  model  fit  for  different  specifications  of 
the  hyper-parameters.  In  fact,  for  the  last  two  cases  where  (ock,  pk)  =  (2, 2)  and  ck  =  20,  50,  the 
fitting  of  the  model  is  basically  the  same.  At  the  bottom  of  Table  3,  we  have  included  the 
LPML  statistic  for  the  semiparametric  model  proposed  by  Chen  et  al.  (1999)  when  using 
a  partition  of  J  =  5  intervals  in  such  a  way  that  the  number  of  failures  in  each  partition  is 
approximately  the  same.  We  took  the  prior  distributions  Xj  ~  Ga(0.01, 0.01)  for  the  baseline 
hazard,  and  independent  N( 0, 100)  priors  for  the  regression  coefficients  in  their  proportional 
hazards  structure.  Clearly,  based  on  the  LPML  statistics,  the  model  by  Chen  et  al  (1999) 
does  not  perform  as  well  as  our  model. 

The  co  variates  appear  in  our  semiparametric  model  (15)  via  two  different  ways:  (i)  in  a 
multiplicative  manner  affecting  the  ‘baseline’  hazard  and  (ii)  in  the  Poisson  mean  of  zt  affect¬ 
ing  the  cure  threshold.  Posterior  estimates  of  the  covariate  effects  on  the  ‘baseline’  hazard,  y, 


Table  3.  Posterior  estimates  of  the 
LPML  statistics  for  the  BMT  data  set 


a* 

Pk 

Ck 

LPML 

1 

1 

0 

-163.33 

2 

2 

0 

-162.85 

2 

2 

5 

-162.61 

2 

2 

20 

-161.74 

2 

2 

50 

-161.52 

Model  of  Chen  et  al 

-165.50 
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Table  4.  Estimated  covariate  effects  on  the  hazard  for  the  BMT  data 


Covariate 

Our  model 

Model  of  Chen  et  al. 

Post,  mean 

95%  Cl 

Post,  mean 

95%  Cl 

Intercept 

4.59 

(2.79,  6.42) 

Transplant  type 

0.03 

(  0.92.0.86) 

0.27 

(-0.62,1.14) 

Hodgkin’s  disease 

1.17 

(0.14,  2.15) 

1.02 

(0.0003,  2.05) 

Karnofsky  score 

-0.07 

(-0.08,  -0.05) 

-0.06 

(-0.08,  -0.04) 

Waiting  time 

-0.01 

(-0.03,0.008) 

-0.01 

(-0.03,0.006) 

Table  5.  Estimated  covariate  effects  on  the  cure 
threshold  for  the  BMT  data 


Covariate 

Post,  mean 

95%  Cl 

Transplant  type 

0.10 

(-0.73,0.88) 

Hodgkin’s  disease 

-0.74 

(-1.70,  -0.001) 

Karnofsky  score 

0.03 

(0.02,  0.04) 

Waiting  time 

0.004 

(-0.01,0.02) 

are  presented  in  Table  4,  when  taking  (otk,  fik,Ck)  =  (2, 2,  50).  We  see  that  the  type  of  trans¬ 
plant  (autogeneic  or  allogeneic)  and  the  waiting  time  had  no  significant  effects  on  the  hazard, 
while  a  diagnosis  of  Hodgkin’s  disease  and  the  Karnofsky  score  did.  The  risk  of  death  or  dis¬ 
ease  relapse  was  approximately  twice  higher  for  patients  with  Hodgkin’s  disease  than  those 
with  non-Hodgkin’s  lymphoma.  The  Karnofsky  score  imposed  a  significant  negative  effect 
on  the  hazard,  i.e.  a  decrease  of  10  units  in  the  Karnofsky  scale  would  increase  by  50%  the 
risk  of  death  or  disease  relapse. 

In  Table  4,  we  have  also  included  the  estimated  co  variate  effects  when  fitting  the  model  of  Chen 
et  al.  (1999).  Comparing  the  results  of  the  two  models,  we  can  see  that  the  posterior  estimates 
for  the  Karnofsky  score  and  waiting  time  are  the  same,  while  those  for  the  transplant  type  and 
Hodgkin’s  disease  are  slightly  different.  Using  the  model  by  Chen  et  al  (1999),  we  can  obtain 
the  same  conclusion  regarding  the  covariate  effects  in  the  hazard  as  that  with  our  model.  A  diag¬ 
nosis  of  the  Hodgkin’s  disease  and  Karnofsky  score  had  significant  effects  on  the  hazard.  This 
supports  the  fact  that  even  though  the  models  are  different,  they  provide  similar  results. 

One  major  advantage  of  our  model  is  that  we  can  estimate  the  cure  threshold.  We  present 
the  co  variate  effects  in  the  Poisson  mean  to  model  the  cure  times  in  Table  5.  We  can  see  that 
the  covariates  with  a  significant  effect  on  the  cure  threshold  are,  a  diagnosis  of  Hodgkin’s 
disease,  and  the  Karnofsky  score.  A  patient  with  Hodgkin’s  disease  would  have  a  reduction 
of  50%  in  the  mean  cure  time  compared  to  a  patient  with  non-Hodgkin’s  lymphoma.  It  might 
not  seem  sensible  that  a  patient  with  a  more  serious  condition  would  experience  a  cure  sooner. 
However,  because  the  presence  of  Hodgkin’s  disease  increases  the  hazard  twice  as  much,  this 
implies  that  patients  diagnosed  with  Hodgkin’s  disease  are  likely  to  die  more  quickly  than 
patients  diagnosed  with  non-Hodgkin’s  lymphoma.  Therefore,  once  a  patient  with  Hodgkin’s 
disease  survives  beyond  a  relatively  shorter  threshold  of  time,  then  he/she  would  be  con¬ 
sidered  cured.  Similarly,  10  points  more  in  the  Karnofsky  scale  would  increase  35%  the 
mean  cure  time. 

Once  we  have  estimated  the  parameters  of  the  model,  we  can  obtain  predictive  estimates 
of  other  quantities  of  interest,  such  as  the  hazard  and  survival  functions,  cure  thresholds  and 
cure  rates.  Toward  this  goal,  we  considered  four  hypothetical  patients  with  differing  Karnofsky 
scores  and  differing  diagnoses  of  Hodgkin’s  disease  or  non-Hodgkin’s  lymphoma,  which  are 
the  two  co  variates  with  significant  effects  in  the  model.  Table  6  shows  the  co  variates  for  the 
four  new  patients  together  with  estimates  for  the  cure  times  (based  on  95%  quantiles)  and  cure 
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Table  6.  Predictive  cure  thresholds  and  cure  pro¬ 
portions  for  new  patients  in  the  BMT  data 


Patient 

(xi,x2,x3,x4) 

% 

95%  quantile 

71 

Post,  mean 

(0, 0, 90,  36) 

34  months 

0.64 

(0,1,90,  36) 

24  months 

0.50 

(0, 0, 60,  36) 

15  months 

0.27 

(0,1,60,  36) 

10  months 

0.15 

probabilities  when  setting  (aLk,  fk,Ck)  =  (2,2,  50).  As  expected,  patients  with  Hodgkin’s  disease 
have  shorter  cure  times  (15  and  10  months)  than  patients  with  non-Hodgkin’s  lymphoma 
(34  and  24  months),  for  Karnofsky  scores  of  90  and  60,  respectively  On  the  other  hand,  patients 
with  a  larger  Karnofsky  score  have  a  larger  probability  of  being  cured  from  the  disease. 

Finally,  in  Figs  5  and  6,  we  present  the  predictive  hazard  and  survival  function  estimates 
for  the  four  patients,  respectively.  The  hazard  rate  for  patients  with  Hodgkin’s  disease  and  a 
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Fig.  5.  Predictive  hazard  rate  estimates  for  four  patients  with  covariates  (x\,  x2,  x3,  xf):  solid  line 
(0,0,90,36),  dotted  line  (0,1,90,36),  dashed-dotted  line  (0,0,60,36)  and  dashed  line  (0,1,60,36). 


Fig.  6.  Predictive  survival  function  estimates  for  four  patients  with  covariates  (xi,  x2,  x3,  x4):  solid  line 
(0, 0, 90,  36),  dotted  line  (0, 1, 90,  36),  dashed-dotted  line  (0, 0, 60,  36)  and  dashed  line  (0, 1,  60,  36). 
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low  Karnofsky  score  is  higher  than  that  for  patients  with  non-Hodgkin’s  lymphoma 
and  a  high  Karnofsky  score.  In  the  predictive  survival  curve  estimates,  we  can  see  that  the 
Karnofsky  score  strongly  affects  the  probability  of  being  cured.  When  the  Karnofsky  score 
increases  from  60  to  90,  the  cure  rate  increases  from  15%  to  50%,  for  patients  with  Hodgkin’s 
disease,  and  from  27%  to  64%,  for  patients  with  non-Hodgkin’s  lymphoma. 


6.  Conclusions 

We  have  proposed  a  new  cure  rate  model  that  explicitly  incorporates  a  parameter  that  deter¬ 
mines  the  time  after  which  a  patient  can  be  considered  cured.  This  is  in  contrast  to  other 
cure  rate  models  with  which  the  estimation  of  a  cure  time  is  not  possible.  In  reality,  after  the 
cure  threshold,  the  hazard  drops  to  zero,  as  the  susceptible  or  non-cured  subjects  would  have 
already  experienced  the  event,  and  the  rest  of  the  subjects  in  the  population  would  be  con¬ 
sidered  risk  free.  This  is  a  distinct  and  attractive  feature  of  our  model  compared  with  other 
cure  rate  models  in  the  literature,  which  allows  the  hazard  to  slowly  decay  to  zero  regardless 
of  the  cure  threshold. 

For  making  inference,  we  have  proposed  a  semiparametric  prior  distribution  based  on  a 
mixture  of  a  Markov  gamma  process  and  a  point  mass  at  zero.  This  allows  us  to  model  the 
cured  and  uncured  groups  at  the  same  time,  whereas  the  two  groups  are  modelled  separately 
as  a  mixture  in  other  cure  models.  We  have  extended  our  proposal  to  cope  with  covariates  in 
a  Bayesian  semiparametric  cure  threshold  model.  Our  semiparametric  construction  includes 
an  exponential  link  function  of  the  covariates  in  a  multiplicative  way,  as  well  as  allows  each 
patient  to  have  a  different  cure  time  depending  on  the  covariates.  Implementation  of  our 
model  is  straightforward  via  a  Gibbs  sampler. 

In  a  frequentist  approach,  estimation  of  the  threshold  parameter  t  in  (5)  might  be  challeng¬ 
ing  when  leaving  the  non-negative  function  h(t)  completely  unknown.  The  partial  likelihood 
method  by  taking  h(t)  as  a  nuisance  parameter  does  not  appear  to  be  promising,  especially 
for  model  (15).  Alternative  approaches  for  studying  the  frequentist  properties  of  (5)  would 
be  required. 
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Abstract:  Advances  in  molecular  biology  demonstrate  that  cancer  is  heterogeneous  disease  necessitating  a 
personalized  management  approach.  This  is  introducing  a  paradigm  shift  in  clinical  trial  designs  where  molecular 
characterization  of  cancers  is  assuming  importance  equal  to  (or  even  more  than)  the  traditional  histologic 
diagnosis  as  the  eligibility  criterion  for  randomized  clinical  trials  of  new  therapies.  Recommendations  have  been 
made  to  gather  the  molecular  information  from  clinical  phase  II  trials  distinguishing  responding  from  non 
responding  tumors  for  subsequent  planning  of  large  scale  phase  III  trials.  However  by  the  time  we  reach  phase  II 
level,  more  than  a  billion  dollars  apart  from  years  of  research  have  been  invested.  It  would  be  therefore  prudent 
to  conceptualize  laboratory  based  platforms  to  obtain  the  proof  of  concept  as  early  as  possible,  even  before 
embarking  upon  the  pivotal  clinical  trials.  In  this  regard,  we  hereby  propose  and  detail  a  novel  preclinical  platform 
incorporating  the  existing  mouse  models  to  address  the  issue  of  tumor  heterogeneity  in  a  systematic  manner 
through  creation  of  a  setting  similar  to  phase  II  trials  in  human  patients.  By  providing  critical  information  about  a 
drug’s  efficacy  and  the  molecular  determinants  of  response  early  on,  this  platform  would  potentially  provide  a 
solid  foundation  to  build  avant-garde  clinical  trials  integrating  recent  advances  in  molecular  medicine. 


Keywords:  Cancer,  heterotransplant,  animal  model,  therapeutics,  drug  development 


Introduction 

The  approval  rates  for  the  Investigational  New 
Drugs  (INDs)  have  been  decreasing  whereas 
the  developmental  cost  and  duration  are 
increasing  exponentially  [1,2].  It  is  estimated 
that  approximately  90%  of  the  INDs  fail  to  get 
US  FDA  approval  and  this  situation  is  even 
worse  in  the  oncology  field,  with  over  95% 
failure  rate  3.  What  is  more  worrisome  is  more 
than  70%  of  the  anti-cancer  drug  candidates 
fail  in  Phase  II  and  approximately  60%  of  those 
entered  into  Phase  III  fail,  causing  enormous 
loss  of  time  and  resources  [3,4].  The  major 
cause  of  Phase  II  /  III  failures  is  lack  of  efficacy 
of  the  testing  agents  4  with  80%  of  phase  III 
failures  in  oncology  attributed  to  lack  of 
efficacy  [5]. 


While  lack  of  efficacy  has  been  traditionally 
attributed  to  ineffective  drugs,  in  the  post 
genomic  era  it  has  been  realized  that  many 
potentially  effective  agents  fail  because  we  fail 
to  address  the  molecular  subtypes  and 
signatures  accounting  for  variations  in 
response  to  treatment  and  survival  among 
patients  [6,7].  This  results  in  erroneous  clinical 
trial  design/  analysis,  enormous  wastage  of 
patient  and  monetary  resources,  addition  of 
another  failed  therapy  to  the  graveyard,  and 
demise  of  potentially  effective  therapies  as 
exemplified  by  the  failure  of  Gefitinib  to 
improve  outcome  in  non-small-cell  lung 
cancer  when  added  to  chemotherapy.  On  the 
other  hand,  brilliant  successes  of  Trastuzumab 
and  Imatinib  suggest  that  integration  of 
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molecular  determinants  of  response  in  clinical 
trials  is  a  promising  strategy  to  test  the 
intrinsic  efficacy  of  an  IND  against  a  specific 
patient  subgroup.  Recommendations  have 
therefore  been  made  for  enrichment  trial 
design  by  limiting  enrollment  to  patients 
(over)expressing  a  molecular  target  or  putative 
predictors  of  response  [7,8]-  However,  at  the 
time  of  clinical  studies,  such  predictors  are 
either  hypothetical  or  we  need  another  clinical 
trials  of  other  drugs  affecting  the  same  target 
to  reach  a  validated  predictor. 

To  address  these  issues,  experts  have 
recommended  utilizing  phase  II  trial  analysis  to 
identify  the  molecular  markers  distinguishing 
responders  and  non  responders  for 
subsequent  planning  of  randomized  trials 
evaluating  INDs  [9,10].  It  should  however  be 
remembered  that  by  the  time  the  we  reach 
phase  II  level,  approximately  billion  dollars  and 
years  of  research  has  been  invested  [11]. 
Considering  the  slow  pace  of  clinical  trials,  the 
high  rates  of  compound  attrition  late  in  clinical 
phase,  the  relatively  small  number  of  patients 
available  for  trials  and  the  finite  R&D  budgets 
of  the  biopharmaceutical  industry,  solving  the 
challenges  of  personalized  therapies  requires 
low-cost,  preferably  laboratory  based  model 
systems  to  test  an  IND’s  efficacy  and  identify 
and  validate  molecular  predictors  of  response 
even  before  we  embark  upon  the  clinical  trials. 
Fortified  with  such  a  data,  we  would  be  able  to 
design  our  clinical  trials  with  greater 
confidence  and  save  enormous  amount  of 
time,  money  and  patient  resources.  We  believe 
that  a  potential  solution  lies  in  the  innovative 
use  of  mouse  models  of  human  cancer  with  an 
aim  to  facilitate  the  identification  of  the 
efficacious  drugs,  as  well  as  predictors  of 
response.  However,  to  ferret  out  the  most 
appropriate  mouse  model  from  amongst  the 
existing  ones,  it  is  imperative  to  understand 
the  advantages  and  pitfalls  of  available  mouse 
models. 

The  recent  unraveling  of  complete  mouse 
genome  sequence  has  strengthened  our 
capability  to  study  the  parallels  and  contrasts 
between  the  pharmacology  of  drug  stability, 
metabolism  and  action  between  mouse  and 
human  studies  and  have  thus  reinvigorated 
our  interests  in  mice  models  [12].  While  the 
development  of  genetically  engineered  mouse 
models  has  contributed  greatly  towards 
understanding  the  process  of  carcinogenesis 
and  target  selection,  the  xenograft  models, 


established  by  injecting  0. 5-1.0  million 
cultured  cancer  cells  subcutaneously  in  a  nude 
/  SCID  mice,  have  been  more  popular  for  drug 
screening  purposes  due  to  its  ease,  low  cost, 
and  faster  establishment.  Almost  every 
successful  cancer  therapy  developed  in  the 
modern  era  has  undergone  xenograft  testing, 
however  many  agents  that  show  consistent 
and  potent  anticancer  efficacy  in  xenografts, 
fail  in  the  clinical  trials  due  to  lack  of  efficacy. 
This  might  be  due  to  reliance  of  xenografts  on 
small  numbers  of  homogeneous  cell  lines 
adapted  to  the  artificial  culture  conditions  and 
acquisition  of  biological  characteristics 
significantly  different  from  the  original  natural 
clone  over  serial  passages  in  culture. 

In  an  attempt  to  circumvent  this  flaw, 
investigators  have  tried  transplanting  fresh 
cells  or  tissues  obtained  directly  from  the 
cancer  patients  into  the  nude  /  SCID  mice, 
called  heterotransplants.  This  approach  has 
demonstrated  superior  correlation  of 
chemosensitivity  and  specificity  data  for 
individualized  therapy  [13-18].  Amongst  these 
approaches,  studies  where  intact  tissue  from 
the  patient  was  transplanted  into  the  mice 
have  shown  the  excellent  patient  response 
prediction  rates  of  90%  and  97%  for 
chemosensitivity  and  chemoresistance, 
respectively  [19].  We  have  previously  reported 
the  response  rate  of  21%  (95%  Cl,  9-38%)  to 
paclitaxel  in  a  series  of  34  NSCLC 
heterotransplants,  comparable  to  the 
response  rates  observed  in  chemotherapy- 
naive  NSCLC  patients  [20].  As  the  original 
microenvironment  of  the  human  tumor  is 
retained,  the  crucial  interplay  of  human 
stroma  -  neoplastic  components  is 
recapitulated  to  a  greater  extent  in 
heterotransplants.  This  contributes  to  better 
replication  of  pharmacogenomic  profiles, 
histology,  chromosome  complement,  antigen 
expression,  and  gene  expression  of  human 
tumors  [19-21],  accounting  for  their  better 
preclinical  predictive  value  as  compared  to  cell 
line  based  xenografts.  Therefore  out  of  the 
available  models,  heterotransplants  seem 
closest  to  an  ‘ideal’  model  to  establish  a  novel 
platform  for  Systematic  Preclinical  Efficacy 
Evaluation  of  Drugs  (SPEED). 

The  SPEED  approach  is  designed  to  address 
the  fundamental  uncoupling  or  disconnection 
between  the  bench  scientist,  who  may  not 
understand  the  intricacies  of  clinical  efficacy 
trials,  and  the  clinical  scientist,  who  may  find  it 
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CTRL  Arm  :  50  Pts 


Tt  Arm  :  50  Pts 
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50  mouse  models 


Figure  1:  A  mouse  cohort  prepared  from  50  models  for  a  specific  malignancy  would  be  equivalent  to  100 
patients  enrolled  in  a  clinical  trial.  Both  test  (Tt)  and  controls  (CTRL)  arms  of  the  mouse  cohort  can  be 
generated  readily  at  a  fraction  of  cost  and  time  compared  to  clinical  trials.  The  two  arms  of  mouse  cohort  are 
adequately  matched  with  regard  to  various  biological  properties. 


difficult  to  make  calculated  translation  of 
bench  side  discoveries.  For  SPEED,  a 
reasonable  number  (close  to  the  number 
needed  for  clinical  phase  II  studies)  of  well 
characterized  and  validated  heterotransplant 
mouse  models  would  be  established  to  make 
perpetual  cohorts  comprising  of  various 
histologies  (Figure  1).  These  cohorts  would  be 
used  to  carry  out  preclinical  IND  efficacy 
studies  in  a  systematic  manner  (replicating 
clinical  phase  II  trials)  to  gauge  an  agent’s 
possible  clinical  efficacy  and  identify  the 
molecular  markers  of  response. 

We  must  understand  that  as  of  now,  the 
efficacy  criteria  used  to  advance  an  agent  in 
preclinical  studies  are  different  from  those  in 
the  clinical  setting.  For  example,  the  NCI 
criterion  for  assessing  a  drug  response  in  mice 
models  is  58%  inhibition  of  tumor  growth.  In  a 
clinical  trial,  however,  this  would  define  a 
Progressive  Disease.  Alternatively,  animals  are 
treated  immediately  after  transplantation, 
before  the  development  of  overt  tumors;  in 
essence,  studying  a  form  of  chemoprevention 


which  is  of  limited  therapeutic  relevance. 
Therefore  the  efficacy  results  of  mice  studies 
frequently  do  not  translate  into  standard 
clinical  results. 

Moreover,  these  preclinical  efficacy  results, 
usually  obtained  in  a  limited  number  of  mice, 
are  often  considered  exciting  enough  to 
initiate  pivotal  clinical  trials,  not  surprisingly 
leading  to  more  than  90%  failure  rates.  Until 
we  systematically  test  the  drugs  at  preclinical 
stage  within  a  framework  of  convincing 
statistical  power/  design,  we  might  not  emerge 
of  the  current  situation  where  success  in  drug 
development  depends  more  upon  chance 
rather  than  scientific  and  robust  foundation.  In 
this  regard,  The  SPEED  platform  provides  us 
an  opportunity  to  apply  the  principles  of 
biostatistics  to  meticulously  design  the 
preclinical  studies  in  a  systematic  manner.  The 
required  number  of  models  according  to  the 
power  of  study  can  be  created  and  not  only  the 
clinical  criteria  of  ‘response’  but  also  the 
‘response  rates’  can  be  measured  in  the 
mouse  cohort  to  gauge  the  clinical  efficacy  of 
the  INDs.  A  careful  sample  size  calculation 
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Figure  2:  A  sample  of  fresh  tumors  are  obtained  from  lung  cancer  patients  at  the  time  of  surgery  and 
transplanted  subcutaneously  into  4-  6  weeks  old  female  nu+/nii+  mice  under  anesthesia  by  a  small  incision 
in  bilateral  axillary  area.  On  reaching  a  size  of  1,500  mm3,  the  tumors  are  harvested  and  propagated  further 
in  2nd  batch  of  mice.  After  these  tumors  are  established,  tumor  tissue  is  harvested,  cryo  preserved,  and  can 
be  used  to  prepare  mouse  cohorts  as  and  when  required.  Multiple  cohorts  may  be  generated  simultaneously 
to  test  various  drugs,  doses,  or  combinations. 


should  help  us  achieve  the  scientific  objectives 
and  extract  information  from  the  preclinical 
data  to  translate  into  clinical  setting. 

As  a  first  step  in  this  regard,  we  have  initiated 
to  establish  a  cohort  of  heterotransplant 
models  from  primary  non  small  cell  lung 
cancer  (NSCLC)  tumors  of  unrelated  patients. 
The  tumors  are  propagated  in  mice  for  limited 
number  of  passages  to  retain  the  human 
components  of  the  tumor  as  much  as  possible. 
Because  tumor  pieces  can  be  cryopreserved 
and  re-implanted  with  high  success  rates  as 
and  when  required  [21],  each  model  will  allow 
expansion  of  early  passage  materials  for 
hundreds  of  mice  tests  (Figure  2).  By  enabling 
us  to  collect  data  on  modulations  in  biology, 
molecular  profiles,  and  response  to  various 
interventions  in  these  close-  to-  human  mouse 
models,  SPEED  approach  has  significant 
potential  for  anticancer  research. 

SPEED  approach  can  be  utilized  to  discover 
molecular  markers  of  response  and  evaluate 
targeted  agents  for  suitability  to  modern 
personalized  clinical  trials.  In  numerous  cases 


the  excellent  preclinical  results  with  INDs  do 
not  translate  in  the  clinical  practice  because 
we  do  not  have  guiding  examples  of  where  a 
class  of  agents  would  be  most  effective.  Most 
trials  of  targeted  agents  are  still  conducted 
empirically  and  Tail’  because  many 
participants  did  not  express  the  target  or  the 
molecular  predictor(s)  of  response  [6,22].  In 
this  respect,  SPEED  approach  can  guide  us  by 
providing  preclinical  information  to  enrich  trial 
cohorts  by  selecting  appropriate  patients 
expressing  molecular  signatures  predictive  of 
response  and  bring  on  a  paradigm  shift  from 
the  current  template  approach  to  the  cutting 
edge  personalized  approach  in  clinical  trials. 

SPEED  approach  can  address  many  other 
clinical  questions.  For,  example,  there  are 
questions  about  the  prognostic  value  of  costly 
or  invasive  post-therapy  analysis  of  residual 
tumors,  which  can  be  addressed  using 
comparable  mouse  models  exposed  to  the 
same  therapy.  We  can  establish  a  mouse 
cohort  from  patients  with  residual  or 
progressive/  metastatic  tumors  to  define  the 
molecular/  genetic  determinants  of  response 
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Figure  3:  SPEED  Consortium  proposes  the  key  stakeholders  including  academia,  industry,  government  bodies, 
and  philanthropic  groups  to  join  hands  and  leverage  all  resources  (financial,  data  and  information,  scientific 
expertise)  to  expedite  anticancer  drug  development  process. 


or  resistance.  Yet  another  key  potential  role  of 
these  cohorts  is  to  evaluate  for  differences  in 
the  molecular  signatures  of  tumor  cells  and 
stroma  cells  to  delineate  the  patterns  of  host- 
tumor  microenvironment  that  occur  during 
cancer  regression  or  progression.  These 
cohorts  can  be  used  to  test  new  delivery 
systems,  including  those  enabled  by  recent 
advances  in  nanotechnology.  In  addition, 
these  cohorts  could  aid  in  development  of 
novel  imaging  agents,  mathematical  modeling, 
and  data  reconstruction  and  visualization  to 
address  the  questions  about  human  cancers. 

A  core  constituent  of  SPEED  would  be  the 
central  database  housing  histopathologic, 
genetic,  expression  profiling,  methylation 
profiling,  proteomic  and  other  biologic  data 
about  heterotransplant  tumors  (in  early, 
intermediate,  and  later  passages)  and  their 
human  counterparts  as  an  integrated  source 
of  information.  Its  later  version  would  include 
information  about  the  SPEED  results  testing 
therapeutic  agents  and  experimental  protocols 
and  may  have  free  public  access  for 
widespread  utilization. 


There  are  some  caveats  and  biases  of 
heterotransplant  models  to  be  considered 
during  the  drug  development  approaches. 
Heterotransplants  usually  involve  non¬ 
orthotopic  tumors  and  might  select  out 
angiogenic  clones  capable  of  sustained  tumor 
growth  after  transplantation.  It  may  not  fully 
capture  the  genetic  diversity  of  metastatic 
disease,  a  major  biological  issue  challenging 
the  efficacy  of  targeted  therapeutics  in  the 
deadly  solid  tumors.  Also,  in  our  own 
experience  only  approximately  50% 
heterotransplanted  human  tumors  actually 
achieve  engraftment  in  mice.  Therefore  these 
models  may  provide  efficacy  data  only  against 
those  tumors  growing  in  mice.  As  the  host 
(SCID  and  nude)  mice  have  profound  defects 
in  their  immune  response,  it  would  preclude 
the  testing  of  immunomodulatory  agents  in  the 
system.  Also  because  SCID  mice  show  defects 
in  DNA  repair  (which  could  limit  testing  of 
some  cytotoxics)  and  nude  mice  show  an 
overall  frailty,  it  can  limit  their  capacity  to 
tolerate  novel  treatments.  There  may  be 
further  limitations  of  heterotransplant  models 
but  we  should  remember  that  in  view  of  more 
than  90%  drug  failure  rates,  it  is  imperative  to 
explore  novel  approach  to  use  existing  models 
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while  continuing  our  search  for  the  better 
models.  Till  the  time  we  find  one,  the  SPEED 
approach  may  well  be  based  on 
heterotransplant  mouse  models.  Future 
clinical  trials  based  on  this  approach  would 
guide  us  to  refine  this  preclinical  strategy 
further  and  the  knowledge  build  up  would  help 
in  the  evolution  of  a  more  scientific  and  robust 
foundation  of  IND  development  process. 

Future  Directions 

The  scientific  scope  of  SPEED  may  be 
expanded  to  include  other  malignancies 
through  concerted  efforts  of  interested  groups 
and  stakeholders  to  institute  a  consortium 
(Figure  3).  The  primary  goal  of  the  SPEED 
consortium  would  be  to  provide  accurate, 
faithful,  and  reproducible  mouse  model 
cohorts  to  the  research  community  for  further 
investigation  and  explorations.  SPEED 
consortium  through  development  of  promising 
agents  and  biomarkers  could  also  guide 
federal  regulatory  bodies  on  their  use  and 
quality  control  for  cancer  treatment.  Moreover, 
the  consortium  illustrates  the  exceptional 
potential  for  development  of  evidence  to 
improve  understanding  of  the  biology  of 
cancer. 

In  conclusion,  the  high  attrition  rates  compel 
us  to  revisit  the  science,  strategy  and 
processes  currently  used  in  drug  development. 
There  is  a  need  for  scientific  and  technological 
innovations  to  obtain  early  readouts  for  proof 
of  concept  to  decrease  drug  attrition  rates  at  a 
later  stage.  In  this  regard  SPEED  paradigm, 
fostering  meticulous  statistical  modus 
operandi,  provides  a  perfect  launch  pad  to  the 
mouse  models  to  leap  into  an  exciting  new  era 
of  drug  development  and  provides  a  robust 
keystone  to  decrease  current  drug  attrition 
rates  in  clinical  trials. 
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Correlation  between  VEGF/VEGFR2  and  EGFR  immunohistochemical 


protein  expression  in  early  stage  non-small  cell  lung  carcinoma. 

Erminia  Massarelli,  Maria  L.  Prudkin,  Natalie  Ozburn,  Lei  Feng,  Guosheng  Yin, 

Waun  Ki  Hong,  Michael  O’Reilly,  Roy  S.  Herbst,  Ignacio  I.  Wistuba 

Several  lines  of  evidence  indicate  that  activation  of  Vascular  Endothelial  Growth  Factor 
Receptor  (VEGFR)  and  Epidermal  Growth  Factor  Receptor  (EGFR)  pathways  are  critical 
for  non-small  cell  lung  cancer  (NSCLC)  development,  growth  and  progression.  Despite 
the  availability  of  new  biological  compounds  targeting  both  signaling  pathways  in 
NSCLC,  data  about  the  co-expression  of  these  two  pathways  in  a  large  cohort  of  NSCLC 
tumor  samples  are  not  available.  To  better  understand  the  correlation  between  VEGFR 
and  EGFR  pathways  expression,  we  investigated  in  a  large  series  of  surgically  resected 
NSCLC  tumor  tissues  placed  in  tissue  microarrays  the  tumor  immunohistochemical 
(IHC)  protein  expression  of:  VEGF-A,  VEGF-R2,  p-VEGF-R2  (phosphorylated  VEGF- 
R2),  EGFR  and  p-EGFR  (phosphorylated  EGFR).  Correlations  between  VEGF-A, 
VEGFR2,  p-VEGFR2  and  clinicopathologic  information  and  survival  analysis  were 
examined.  In  a  subset  of  NSCLSs  with  adenocarcinoma  histology,  mutation  status  of 
EGFR  and  KRAS  genes  was  correlated  with  IHC  expression.  Two  hundred  eighty-four 
surgically  resected  tumors,  including  179  adenocarcinomas  and  105  squamous  cell 
carcinomas  from  patients  with  stage  I-II-IIIA  NSCLC  were  studied.  A  semi-quantitative 
analysis  of  nuclear,  cytoplasmic  and  membranous  localization  of  IHC  expression  was 
performed  for  each  marker.  Lung  adenocarcinomas  demonstrated  higher  expression  of 
cytoplasmic  VEGF-A  (p=0.0001),  membranous  (p=0.005)  and  cytoplasmic  (p=0.03) 
VEGF-R2,  and  membranous  p-VEGF-R2  (p=0.0002)  compared  to  squamous  cell 
carcinomas.  Lower  VEGF-A  (p=0.0009),  membranous  (p=0.01)  and  cytoplasmic  VEGF- 
R2  (p=0.02)  expression  was  statistically  associated  with  non-smoking  history.  With  a 
median  follow  up  of  4.28  years,  91  deaths  occurred.  Independent  of  age,  histology  and 
stage,  cytoplasmic  p-VEGF-R2  expression  was  found  to  have  a  prognostic  role  for  worse 
overall  survival  (p=0.01,  HR=  1.047,  95%  0=1.01,  1.085).  Of  interest,  significant 
increase  of  cytoplasmic  and  membranous  p-EGFR  expression  was  detected  in  tumors 
showing  higher  levels  of  cytoplasmic  VEGF-A  (p=0.01  and  p=0.0001,  respectively), 
VEGF-R2  (p=0.0001  and  p=0.0001),  and  p-VEGF-R2  (p=0.0001  and  p=0.0003).  In  the 
15  EGFR- mutated  cases  a  significant  lower  membranous  VEGF-R2  expression  was 
observed  (p=0.02).  Higher  membranous  VEGF-R2  expression  was  observed  in  the  9 
KRAS  mutated  cases,  but  this  did  not  reach  statistical  significance.  In  summary,  our 
findings  indicate  that  VEGFR  and  EGFR  pathways  are  positively  correlated  in  early  stage 
NSCLC  and  IHC  expression  of  p-VEGF-R2  is  an  indicator  of  worse  overall  survival  in 
stage  I-IIIA  NSCLC  (Grant  IMPACT  W81XWH-05-0027). 
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Statement  of  Clinical  Relevance. 


A  better  understanding  of  the  signaling  pathways  that  lead  to  tumor  growth  may  help  in  the 
development  of  new  and  more  effective  strategies  for  targeted  chemoprevention  and  treatment  of 
lung  cancer.  Our  finding  of  frequent  overexpression  of  ER  and  PR  in  NSCLC  suggests  that  the 
activation  of  these  pathways  is  an  attractive  novel  target  for  lung  cancer  chemopreventive  and 
therapeutic  strategies.  The  correlation  between  ER  and  EGFR  mutation  in  lung  adenocarcinoma 
suggests  that  it  might  be  important  to  target  both  pathways  simultaneously  for  lung  cancer 
therapy. 
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ABSTRACT 


Purpose:  To  determine  the  frequency  of  estrogen  receptors  (ER)  a  and  (3  and  progesterone 
receptor  (PR)  immunohistochemical  expression  in  a  large  set  of  NSCLC  specimens,  and  to 
compare  our  results  with  those  for  some  of  the  same  antibodies  that  have  provided  inconsistent 
results  in  previously  published  reports. 

Experimental  Design:  Using  multiple  antibodies,  we  investigated  the  immunohistochemical 
expression  of  ERa  and  (3  and  PR  in  317  NSCLCs  placed  in  tissue  microarrays  and  correlated 
their  expression  with  patients’  clinicopathologic  characteristics,  and  in  adenocarcinomas,  with 
EGFR  mutation  status. 

Results:  ERa  and  (3  were  detected  in  the  nucleus  and  cytoplasm  of  NSCLC  cells;  however,  the 
frequency  of  expression  (nucleus:  a,  5%-36%,  and  (3,  42%-56%;  cytoplasm:  a,  <1% — 42%,  and 
(3,  20%-98%)  varied  among  the  different  antibodies  tested.  PR  was  expressed  in  63%  of  tumor 
cell  nuclei.  ERa  nuclear  expression  significantly  correlated  with  adenocarcinoma  histology, 
female  gender,  and  history  of  never  smoking  (P  =  0.0048  to  <0.0001).  In  NSCLC,  higher 
cytoplasmic  ERa  expression  significantly  correlated  with  worse  RFS  (HR  1.77,  95%  Cl,  1.12, 
2.82;  P  =  0.015)  in  multivariate  analysis.  In  adenocarcinomas,  ERa  expression  correlated  with 
EGFR  mutation  ( P  =  0.0029  to  <0.0001).  ER(3  and  PR,  but  not  ERa,  expressed  in  the  normal 
epithelium  adjacent  to  lung  adenocarcinomas. 

Conclusions:  Both  ER  antibodies  and  PR  are  frequently  expressed  in  NSCLC.  ERa  expression 
distinguishes  a  NSCLC  subset  that  has  defined  clinicopathologic  and  genetic  features.  The 
correlation  between  ER  and  EGFR  mutation  in  lung  adenocarcinoma  suggests  that  it  might  be 
important  to  target  both  pathways  simultaneously  in  lung  cancer  therapy. 
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INTRODUCTION 


Lung  cancer  is  the  most  common  cause  of  cancer  mortality  worldwide,  with  over  1 
million  deaths  each  year  (1).  Lung  cancer  includes  several  histological  types,  the  most  frequently 
occurring  of  which  are  two  types  of  non-small  cell  lung  carcinoma  (NSCLC):  adenocarcinoma 
and  squamous  cell  carcinoma  (2).  During  the  last  two  decades,  mortality  rates  associated  with 
cancer  have  continued  to  decrease  across  all  major  sites  in  both  men  and  women;  however,  the 
rates  for  lung  cancer  in  females  have  continued  to  increase  (3,  4).  Despite  global  statistics 
estimating  that  15%  of  lung  cancer  in  men  and  53%  in  women  are  not  attributable  to  smoking 
(1),  smoking  remains  the  primary  risk  factor  for  lung  cancer.  The  higher  proportion  of  lung 
cancer  in  females  who  have  never  smoked  compared  with  males  who  have  never  smoked 
suggests  a  possible  role  for  gender-dependent  hormones  in  the  development  of  lung  cancer  (5). 

Estrogen  receptors  (ER)  a  and  (3  are  expressed  in  normal  lung  tissue  and  in  lung  tumors 
in  both  men  and  women  (6),  yet  the  data  are  inconsistent  as  to  whether  ER  expression  is  gender 
biased  (6-9)  or  associated  with  NSCLC  overall  survival  (9-1 1).  The  data  reported  on  the 
immunohistochemical  expression  for  both  ER  receptors  in  NSCLC  remain  controversial.  ERa 
has  been  reported  to  be  expressed  in  the  nucleus  (0%-45%)  and  cytoplasm  (0%-73%)  of 
malignant  lung  cancer  cells  in  the  cases  examined  (9,  10,  12,  13).  The  percentages  for  ER(3  are 
more  consistent,  with  46%-60%  of  NSCLC  cases  showing  only  nuclear  expression  (9-14). 
Similarly,  two  reports  suggested  that  progesterone  receptor  (PR)  is  frequently  (47%)  expressed 
in  NSCLC  tumor  cells,  and  this  expression  correlated  with  better  patient  outcome  (12,  15). 

Several  in  vitro  and  in  vivo  studies  have  provided  evidence  supporting  a  biological  role 
for  estrogens  in  lung  carcinogenesis  by  direct  promotion  of  cell  proliferation — estrogens 
stimulate  the  proliferation  of  NSCLC  cells  through  estrogen  receptor-mediated  signaling, 
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whereas  anti-estrogens  inhibit  the  growth  of  NSCLC  cells  (6,1, 13,  16,  17).  Estrogen  can 
directly  stimulate  the  transcription  of  estrogen-responsive  genes  in  the  nucleus  of  lung  cells  and 
can  also  transactivate  growth  factor- signaling  pathways — the  epidermal  growth  factor  receptor 
(EGFR)  pathway,  in  particular  (13,  18).  In  estrogen  stimulation  of  lung  cancer  cells,  EGFR 
ligands  are  rapidly  released,  activating  the  EGFR  and  mitogen-activated  protein  kinase  1 
(MAPK1)  growth  pathways  (19).  Activation  of  the  EGFR  pathway  appears  to  play  an  important 
role  in  the  pathogenesis  and  progression  of  NSCLC  (20).  In  lung  cancer  cells,  the  constitutive 
activation  of  EGFR  is  achieved  by  several  mechanisms,  including  increased  production  of 
ligands,  increased  levels  of  the  receptor,  and  mutation  of  the  EGFR  tyrosine  kinase  domain  (20- 
22).  Of  interest,  EGFR  protein  expression  is  downregulated  in  response  to  estrogens  and 
upregulated  in  response  to  anti-estrogens,  suggesting  that  a  reciprocal  control  mechanism  exists 
between  the  EGFR  and  ER  pathways  (19). 

The  purpose  of  the  current  study  was  to  determine  the  frequency  of  ERa  and  (3  and  PR 
protein  immunohistochemical  expression  in  a  large  set  of  NSCLCs  placed  in  tissue  microarray 
(TMA)  specimens  and  to  compare  our  results  with  those  for  some  of  the  same  antibodies  that 
have  provided  inconsistent  results  in  previously  published  reports  (9-14).  In  addition,  the 
receptor-expression  results  were  correlated  with  patients’  clinicopathologic  features,  including 
NSCLC  histology,  gender,  smoking  history,  and  patient  outcome,  and  in  adenocarcinoma  with 
tumors’  EGFR  activating  mutation  status.  Finally,  to  understand  ERa  and  (3  and  PR  protein 
expression  role  in  the  early  pathogenesis  of  lung  cancer,  we  investigated  the  characteristics  of 
ERa  and  (3  and  PR  protein  expression  in  the  non-malignant  respiratory  epithelium  adjacent  to 
tumors  taken  from  a  subset  of  our  retrospectively  reviewed  lung  adenocarcinoma  cases. 
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MATERIALS  AND  METHODS 


Case  selection  and  TMA  construction.  We  obtained  archived,  formalin- fixed,  paraffin- 
embedded  (FFPE)  tissue  from  surgically  resected  (with  curative  intent)  lung  cancer  specimens 
(lobectomies  and  pneumonectomies)  containing  tumor  and  adjacent  normal  epithelium  tissues 
from  the  Lung  Cancer  Specialized  Program  of  Research  Excellence  Tissue  Bank  at  The 
University  of  Texas  M.  D.  Anderson  Cancer  Center  (Houston,  TX),  which  has  been  approved  by 
the  institutional  review  board.  The  tissue  had  been  collected  from  1997  to  2001,  and  the  tissue 
specimens  were  histologically  examined  and  classified  using  the  2004  World  Health 
Organization  classification  system  (2).  We  selected  317  NSCLC  tissue  samples  (201 
adenocarcinomas  and  116  squamous  cell  carcinomas)  for  our  TMAs.  TMAs  were  constructed 
using  triplicate  1-mm  diameter  cores  per  tumor,  and  each  core  included  central,  intermediate,  and 
peripheral  tumor  tissue.  Detailed  clinical  and  pathologic  information,  including  demographics, 
smoking  history  (never-  and  ever-smokers),  and  smoking  status  (never,  former,  and  current), 
clinical  and  pathologic  tumor-node-metastasis  (TNM)  stage,  overall  survival  (OS)  duration,  and 
time  to  recurrence  was  available  for  most  cases  (Supplementary  Table  1).  Patients  who  had 
smoked  at  least  100  cigarettes  in  their  lifetime  were  defined  as  smokers,  and  smokers  who  quit 
smoking  at  least  12  months  before  their  lung  cancer  diagnosis  were  defined  as  former  smokers. 
Tumors  were  pathologic  TNM  stages  I-IV  according  to  the  revised  International  System  for 
Staging  Lung  Cancer  (23). 

To  assess  the  immunohistochemical  expression  of  ERa  and  (3  and  PR  markers  in  the  non- 
malignant  respiratory  epithelium  adjacent  to  lung  tumors,  we  selected  whole  histology  sections 
containing  tumor  and  adjacent  lung  tissue  from  64  adenocarcinomas  that  were  included  in  our 
TMAs. 
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Immunohistochemical  staining  and  evaluation.  The  following  antibodies  against  ERa 
and  (3  and  PR  were  purchased:  1)  ERa-1,  clone  6F1 1,  Novocastra,  Leica  Microsystems  Inc. 
(Bannockburn,  IL);  2)  ERot-2,  clone  6F11,  Chemicon,  Millipore  Corporate  (Billerica,  MA);  3) 
ERa-3,  clone  HC20,  Santa  Cruz  Biotechnology  Inc.  (Santa  Cruz,  CA);  4)  ERa-4,  clone  1D5, 
Lab  Vision  Corporation  (Fremont,  CA);  5)  ER(3-1,  clone  HI 50,  Santa  Cruz  Biotechnology 
(Santa  Cruz,  CA);  2)  ER[3-2,  clone  14C8,  GeneTex  Inc  (San  Antonio,  TX);  and  6)  PR,  clone 
SP2,  Lab  Vision  Corporation  (Fremont,  CA).  Details  on  immunohistochemistry  conditions  and 
characteristics  of  the  antibodies  are  listed  in  Supplementary  Table  2.  Immunohistochemical 
staining  was  performed  as  follows:  5-pM  FFPE  tissue  sections  were  deparaffinized,  hydrated, 
heated  in  a  steamer  for  10  minutes  with  10  mM  sodium  citrate  (pH  6.0)  for  antigen  retrieval,  and 
washed  in  Tris  buffer.  Peroxide  blocking  was  done  with  3%  H2O2  in  methanol  at  room 
temperature  for  15  min,  followed  by  10%  fetal  bovine  serum  in  tris-buffered  saline-t  for  30  min. 
The  slides  were  incubated  with  primary  antibody  at  an  ambient  temperature  for  60  min  for  all 
antibodies;  the  exception  was  ER(3  14C8  (ER(3-2),  which  was  incubated  overnight  at  4°C, 
washed  with  phosphate-buffered  saline,  and  incubated  with  biotin-labeled  secondary  antibody 
(Envision  Dual  Link  +,  DAKO,  Carpinteria,  CA)  for  30  min.  Staining  for  the  slides  was 
developed  with  0.05%  3',  3-diaminobenzidine  tetra  hydrochloride,  which  had  been  freshly 
prepared  in  0.05  mol/L  Tris  buffer  at  pH  7.6  containing  0.024%  H2O2,  and  then  the  slides  were 
counterstained  with  hematoxylin,  dehydrated,  and  mounted.  FFPE  normal  breast  tissue  was  used 
as  the  positive  control.  For  the  negative  control,  we  used  the  same  specimens  used  for  the 
positive  controls  but  replaced  the  primary  antibody  with  phosphate-buffered  saline. 

Two  observers  (M.G.R.  and  I.I.W.)  jointly  quantified  the  immunohistochemical 
expression  of  ERs  and  PR  using  light  microscopy  (magnification  20x).  Both  nuclear  and 
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cytoplasmic  expressions  were  quantified  using  a  four-value  intensity  score  (0,  1+,  2+,  and  3+) 
and  the  percentage  (0%  to  100%)  of  reactivity.  We  defined  the  intensity  categories  as  follows:  0 
=  no  appreciable  staining;  1+  =  barely  detectable  staining  in  epithelial  cells  compared  with  the 
stromal  cells;  2+  =  readily  appreciable  staining;  and,  3+  =  dark  brown  staining  of  cells.  Next,  an 
expression  score  was  obtained  by  multiplying  the  intensity  and  reactivity  extension  values 
(range,  0-300). 

EGFR  mutation  analysis.  Exons  18-21  of  EGFR  were  polymerase  chain  reaction  (PCR)- 
amplified  using  intron-based  primers  as  previously  described  (24,  25).  Approximately  200 
microdissected  FFPE  cells  were  used  for  each  PCR  amplification.  All  PCR  products  were 
directly  sequenced  using  the  Applied  Biosystems  PRISM  dye  terminator  cycle  sequencing 
method.  All  sequence  variants  were  confirmed  by  independent  PCR  amplifications  from  at  least 
two  independent  microdissections  and  DNA  extraction,  and  the  variants  were  sequenced  in  both 
directions,  as  previously  reported  (24,  25). 

Statistical  analysis.  The  immunohistochemical  expression  and  clinicopathologic  data 
were  summarized  using  standard  descriptive  statistics  and  frequency  tabulations.  BFiP  plots 
were  generated  to  summarize  the  distribution  of  ER  and  PR  expressions.  Associations  between 
the  marker  expression  and  patients’  clinical  and  demographical  variables  (including  age,  sex, 
smoking  history,  histology  type,  and  pathologic  stage)  were  assessed  using  appropriate  methods 
including  the  chi-square  or  Fisher  exact  test  for  categorical  variables,  and  Wilcoxon  rank  sum  or 
Kruskal-Wallis  test  for  continuous  variables.  The  Spearman  rank  correlation  coefficient  was  used 
to  estimate  the  correlation  between  immunohistochemistry  markers.  Kaplan-Meier  survival 
curves  for  patient  OS  and  RFS  were  also  generated.  The  log-rank  test  was  used  to  identify  the 
difference  between  the  patient  groups  for  both  overall  and  RFS.  For  univariate  and  multivariate 
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analyses  for  immunohistochemical  expressions,  the  Cox  proportional  hazard  model  was  used. 
Two-sided  P  values  less  than  0.05  were  considered  statistically  significant. 

RESULTS 

Correlation  of  expression  of  ER  antibodies.  We  examined  four  commercially  available 
antibodies  against  ERa:  two  using  the  same  clone  (6F1 1)  and  two  antibodies  against 
ER(3  (Supplementary  Table  2).  Using  the  scores  of  expression  generated  from  all  NSCLCs  we 
analyzed  the  correlation  of  the  expression  in  the  malignant  cells  for  the  four  ERa  and  the  two 
ER(3  antibodies  tested.  All  four  of  the  ERa  antibodies  showed  nuclear  staining,  and  two  of  the 
four  antibodies  also  detected  expression  in  the  cytoplasm  of  malignant  cells  (ERa-3,  clone 
HC20,  and  ERa-4,  clone  1D5).  The  two  ERa  clone  6F1 1  antibodies  (ERa-1  and  ERa-2), 
obtained  from  two  different  companies,  demonstrated  only  nuclear  staining.  All  four  of  the 
ERa  antibodies  significantly  correlated  with  each  other  at  nuclear  expression  (Spearman  rank 
correlation:  r  =  0.32  to  0.48;  P  <  0.0001;  Supplementary  Table  3).  Also,  significant  correlation 
was  detected  in  the  staining  of  the  two  ERa  antibodies,  showing  cytoplasmic  expression  (r  = 

0.43  ;  P  <  0.0001).  There  was  no  statistically  significant  correlation  between  both  of  the  ER(3 
antibodies  examined  in  their  nuclear  expression;  although  they  significantly  correlated  at  their 
cytoplasmic  expression,  the  correlation  coefficient  was  very  low  (r  =  0.17;  P  =  0.005). 

Frequency  of  ER  and  PR  expression  in  NSCLC  specimens  by  histology.  We  analyzed 
the  frequency  of  any  ER  and  PR  immunohistochemical  expression  (positive  cases,  score  >0)  for 
each  antibody  tested  by  NSCLC  tumor  histology,  and  the  data  are  summarized  in  Table  1. 
Representative  microphotographs  of  the  expression  of  ER  and  PR  with  some  of  the  antibodies 
tested  are  shown  in  Fig.  1 .  ERs  and  PR  were  detected  in  the  nucleus  of  malignant  cells  by  all  of 
the  corresponding  antibodies  tested.  However,  when  expressed,  the  percentage  of  malignant  cells 
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showing  staining  was  low  in  general,  with  an  average  percentage  of  positive  expression  of:  19% 
(range  2-90%)  for  ERa-1  nuclear;  13%  (range  2-93%)  for  ERa-2  nuclear;  21%  (range  1-60%) 
and  19%  (range  3-73%)  for  ERa-3  nuclear  and  cytoplasmic,  respectively;  and,  11%  (range  3- 
97%)  and  7%  (range  3-30%)  for  ERa-4  nuclear  and  cytoplasmic,  respectively.  The  average 
percentages  of  positive  cells  expressing  ER(3  were  37  %  (range  3-90%)  and  37%  (range  3-97%) 
for  ER(3-1  nuclear  and  cytoplasmic,  respectively;  and,  13  %  (range  1-77)  and  24  %  (range  3-67) 
for  ER|3-2  nuclear  and  cytoplasmic,  respectively. 

Although  there  are  important  variations  in  the  frequency  of  expression  between  the 
nuclear  ERa  antibodies  tested,  adenocarcinoma  histology  showed  significantly  higher  frequency 
of  expression  than  squamous  cell  carcinomas  for  all  ERa  antibodies  (P  <0.0001-0048;  Table  1). 
For  nuclear  expression  of  ER(3,  the  data  obtained  with  both  antibodies  tested  were  relatively 
consistent,  and  the  adenocarcinoma  histology  demonstrated  a  significantly  higher  frequency  of 
expression  than  the  squamous  cell  carcinoma  did  with  the  ERJ3-2  antibody  (P  =  0.0069).  Two  of 
the  ERa  (ERa-3  and  ERa-4)  and  both  ER(3  antibodies  also  detected  ER  expression  in  the 
cytoplasm  of  NSCLC  cells  (Table  1).  While  the  ERJ3-2  antibody  was  expressed  in  the  cytoplasm 
of  a  subset  of  NSCLCs,  the  ER(3- 1  antibody  was  expressed  in  nearly  all  of  the  tumors. 
Cytoplasmic  expression — only  for  the  ERa-3  antibody — was  significantly  higher  in 
adenocarcinomas  when  compared  with  squamous  cell  carcinomas  (P  =  0.0064). 

In  the  NSCLC  tissues,  PR  expression  was  frequently  detected  in  the  nuclei  of  malignant 
cells  only.  Squamous  cell  carcinoma  histology  showed  a  marginally  significant  higher  frequency 
of  expression  than  that  of  the  adenocarcinomas  (P  =  0.05;  Table  1). 

Correlation  between  ER  and  PR  expression  in  NSCLC  and  patients’  clinicopathologic 
features.  We  correlated  expression  of  ERs  and  PR  for  each  antibody  tested  with  the  patients’ 
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clinicopathologic  characteristics,  including  histology,  gender,  tobacco  history,  and  TNM 
pathological  stage  using  the  expression  score  as  a  continuous  variable.  Using  this  type  of 
analysis,  adenocarcinoma  histology  also  showed  a  statistically  significant  higher  nuclear 
expression  for  all  ERa  antibodies  and  for  the  ER(3-2  antibody  than  squamous  histology  (Table 
2).  Of  great  interest  was  the  fact  that  the  NSCLC  tissues  obtained  from  females  and  never 
smokers  demonstrated  statistically  significant  higher  expression  of  nuclear  ERa  and  (1  for 
several  of  the  antibodies  used  (Table  2).  No  correlations  between  the  expression  of  PR  and  the 
clinicopathologic  characteristics  were  found. 

We  performed  OS  and  recurrence-free  survival  (RFS)  analyses  to  determine  the 
expression  of  ERs  and  PR  for  each  antibody  tested  by  using  specimens  from  317  patients  with 
NSCLC  with  a  median  follow-up  of  6.1  years  for  OS  and  4.2  years  for  RFS.  No  association  was 
detected  between  the  expression  of  ER  and  PR  and  OS.  Of  interest,  any  expression  of 
cytoplasmic  ERa,  using  ERa-4  antibody,  and  nuclear  ER(3,  using  the  ER(3-1  antibody,  conferred 
to  patients  a  significantly  worse  RFS  in  the  both  univariate  and  multivariate  analysis  (Fig.  2  and 
Table  3).  However,  only  the  cytoplasmic  expression  of  ERa-4  correlated  with  worse  RFS  when 
dichotomized  score  being  used  (HR  1.77;  95%  Cl,  1.1 1-2.81;  P  =  0.0156;  Table  3). 

Correlation  between  ER  and  PR  expression  in  NSCLC  and  tumor  EGFR  mutation 
status.  Among  182  adenocarcinoma  cases,  EGFR  mutations  of  the  tyrosine  kinase  domain 
(exons  18-21)  were  detected  in  31  (17%)  cases.  Most  (88%)  EGFR  mutations  were  detected  in 
the  exons  19  and  21,  and  we  did  not  find  correlation  between  the  location  of  the  mutation  and  ER 
a  and  (3  expression.  We  correlated  the  ER  and  PR  scores  and  any  expression  (positive  cases, 
score  >0)  with  EGFR  mutation  status.  Interestingly,  EGFR  mutant  adenocarcinomas 
demonstrated  statistically  significant  higher  expression  than  wild-type  tumors  of  nuclear  ERa, 
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cytoplasmic  ERa,  and  nuclear  ER(3  when  tested  with  antibodies  ERa-3,  ERa-4,  and  ER(3-1, 
respectively  (Table  4  and  Fig.  3).  Because  there  was  a  higher  incidence  of  EGFR  mutation  in 
lung  adenocarcinoma  cases  from  patients  with  a  history  of  never  smoking,  Asian  ethnicity,  or 
female  characteristics  (data  not  shown),  we  adjusted  the  effects  of  age,  gender,  smoking  history, 
ethnicity,  and  pathological  stage  in  the  correlation  of  ERa  and  (3  with  EGFR  mutation  status. 
After  linear  regression  analysis,  all  the  significant  correlations  remained  statistical  significant. 
There  was  no  correlation  between  PR  expression  and  EGFR  mutation  status. 

ER  and  PR  immunohistochemical  expression  in  the  lung  respiratory  airway  adjacent 
to  adenocarcinoma  cases.  To  characterize  the  pattern  of  expression  of  ER  and  PR  in  the 
respiratory  airway  field  in  patients  with  lung  cancer,  we  selected  64  adenocarcinoma  cases  (35 
females  and  29  males;  19  never  smoked,  13  current  smokers,  and  32  former  smokers),  and  we 
studied  the  immunohistochemical  expression  of  ERa  and  (3  and  PR  in  the  respiratory  cells  lining 
the  small  bronchi  (n  =  35  cases),  bronchioles  (n  =  83  cases),  and  alveoli  exhibiting  Type  II  cells 
hyperplastic  changes  (n  =  15  cases)  using  the  same  semi  quantitative  scoring  system  used  in  the 
TMAs.  For  ERs,  we  tested  the  ERa-4  and  ER(3-1  antibodies.  From  each  case,  we  used 
immunohistochemistry  to  examine  whole  tissue  sections  from  a  mean  of  three  different  paraffin 
blocks  (range  3-6)  containing  tumor  and  adjacent  normal  lung  tissue.  We  found  that  ERa  was 
not  expressed  in  the  airway  epithelium  adjacent  to  lung  adenocarcinomas,  including  epithelial 
samples  from  21  positive  tumors  (Supplementary  Table  4).  In  contrast,  ER(3  was  widely 
expressed  in  the  cytoplasm  of  respiratory  cells:  91%  of  bronchi,  84%  of  bronchioles,  and  29%  of 
the  hyperplastic  alveoli.  ER[3  nuclear  immunostaining  was  found  less  frequently:  5%  of  bronchi, 
10%  of  bronchioles,  and  none  of  the  hyperplastic  alveolar  cells.  Noticeably,  in  the  bronchial 
cells,  we  identified  two  patterns  of  cytoplasmic  immunostaining:  a  homogeneous  staining  in  all 
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types  of  bronchial  cells  and  heterogeneous  staining  comprising  only  ciliated  cells  with  mainly 
supranuclear  or  apical  expression  (Fig.  1).  PR  was  found  in  the  nucleus  of  56%  of  bronchi,  61% 
of  bronchioles,  and  33%  of  hyperplastic  alveoli.  Of  interest,  there  was  a  high  level  of  correlation 
(28/33  comparisons,  85%)  between  the  expression  of  PR  in  the  normal  epithelium  and  the 
corresponding  tumors.  Twenty  (95%)  out  of  21  cases  with  PR  positive  in  the  normal  epithelium 
were  detected  in  patients  with  tumors  that  also  expressed  this  receptor. 

DISCUSION 

ERa  and  (3  frequently  expressed  in  our  NSCLC  cases,  and  ERa  expression  distinguished 
a  subset  of  NSCLC  that  has  defined  clinicopathologic  and  genetic  features.  Although  the 
immunohistochemical  expression  of  ERa  and  (3  has  been  reported  in  tumor  tissue  specimens 
from  surgically  resected  NSCLCs,  the  data  on  the  fraction  of  tumors  expressing  ER  are  still 
controversial.  Previous  studies  on  ERa  immunohistochemical  expression  in  formalin- fixed  and 
paraffin-embedded  NSCLC  specimens  using  seven  different  antibodies  identified  nuclear 
expression  in  malignant  cells  in  frequencies  that  ranged  from  none  (10,  14)  to  18%  (26)  and  38% 
(12).  Similarly,  in  other  studies,  the  frequency  of  ERa  cytoplasmic  expression  in  NSCLC  ranged 
from  0%-3%  (12,  26)  to  35%  (1 1)  and  73%  (27).  In  the  current  study,  using  four  different 
commercially  available  ERa  antibodies,  we  also  identified  a  wide  range  of  percentages  in  the 
frequency  of  NSCLCs  exhibiting  any  expression  of  ERa  in  the  nucleus  (7%-54%)  and  in  the 
cytoplasm  (0%-42%)  of  tumor  cells.  However,  in  our  study,  when  the  scores  of 
immunohistochemical  expression  were  analyzed  as  continuous  variables,  all  of  the 
ERa  antibodies  significantly  correlated  with  each  other  at  nuclear  and  cytoplasmic  locations. 

A  similar  situation  is  observed  when  the  ER(3  immunohistochemical  expression  data  are 
examined  in  NSCLC.  Several  previous  studies,  using  six  different  antibodies,  have  reported 
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frequencies  of  ER(3  expression  in  tumors  with  a  wide  range  of  percentages  at  the  nuclear 
location — 0%  (9),  34%47%  (10,  12,  14),  and  61%-84%  (9,  11)  — but  not  in  the  cytoplasm  of 
malignant  cells,  where  most  of  the  studies  have  shown  no  reactivity  (9,  10,  12,  14);  some 
expression  was  seen  in  a  small  number  of  cases  (6)  or  low  frequency  of  expression  in  a  large 
number  of  cases  (10%)  (1 1).  In  the  present  study  using  two  antibodies,  any  ERp  nuclear 
expression  was  detected  in  about  half  (56%  and  42%)  of  the  NSCLCs,  and  cytoplasmic 
expression  was  found  in  a  wider  range  (20%-98%)  of  our  cases.  We  do  not  have  a  definitive 
explanation  to  the  high  levels  of  expression  of  ERP  in  NSCLC  cells  in  our  study,  and  the 
discordance  with  previous  reports.  However,  immunohistochemical  analysis  has  shown  the 
distribution  of  ERp  to  be  much  more  widespread  than  ERa  (28-30).  Several  studies  have 
reported  that  ERp  immunohistochemical  expression  is  frequently  detected  in  the  nucleus  and 
cytoplasm  of  normal  respiratory  cells  (28).  While  expression  has  been  questioned  by  suggestions 
that  this  observation  is  based  on  non-specific  binding  produced  by  unpurified  antibodies  (31), 
multiple  reports  have  shown  the  presence  of  a  non-nuclear  pool  of  ERs  in  normal  and  malignant 
cells  (32-35).  Yang  et  al  (35)  used  one  of  the  same  ERP  antibodies  that  we  used  (ERP-1)  and 
demonstrated  mitochondrial  localization  of  this  receptor  in  several  normal  human  and  murine 
cells,  suggesting  a  role  for  ERP  receptor  in  the  cytoplasm  of  cells.  Our  finding  of  high  frequency 
of  ERp  expression,  using  ERb-1  antibody,  in  the  cytoplasm  of  normal  respiratory  cells  from  our 
lung  adenocarcinoma  patients  are  consistent  with  these  findings. 

Several  discrepancies  were  observed  when  we  compared  our  results  with  those  published 
previously  (6,  9,  10,  12)  using  the  same  antibodies,  especially  for  ERa.  For  example,  our  ERa-3 
antibody,  raised  against  the  COOH-terminus  region  of  the  protein,  detected  any  nuclear  and 
cytoplasmic  expressions  in  54%  and  42%  of  our  NSCLC  cases,  respectively.  Using  this 
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antibody,  nuclear  expression  was  reported  in  a  small  number  of  NSCLC  tumors  by  Stabile  et  al 
(6)  and  in  none  of  the  130  tumors  examined  by  Kawai  et  al  (10).  At  the  cytoplasmic  location  of 
malignant  cells,  both  studies  reported  positive  immunostaining  (6,  10),  with  up  to  73%  of  cases 
in  the  study  performed  by  Kawai  et  al  (10). 

Why  these  inconsistent  results  on  the  immunohistochemical  expression  of  ER  a  and  (3 
occur  raises  a  very  important  question.  Clearly  the  reasons  for  the  inconsistent  results  include  the 
use  of  different  antibodies  manufactured  from  different  clones  and  by  different  companies. 
Indeed,  some  of  these  antibodies  have  been  made  against  different  parts  of  the  protein:  full 
length,  NH2-terminues,  and  COOH-terminus  regions.  It  has  been  suggested  that  several  mRNA 
splicing  variants  of  ERa  have  been  detected  in  lung  cancer  cell  lines,  and  antibodies  raised 
against  epitopes  in  the  deleted  exons  of  ER  may  give  conflicting  results  (6).  In  addition,  it  is 
important  to  note  that  there  are  multiple  criteria  reported  to  assess  ERa  and  (3  positivity  in 
NSCLC  tissues.  Although  most  studies  considered  different  levels  of  intensity  (usually  a  scale  0- 
3+)  of  expression  at  nuclear  and  cytoplasmic  locations  combined  with  the  percentage  of 
malignant  cells  expressing  a  given  intensity,  the  cutoff  levels  of  expression  vary  significantly 
between  studies  (e.g.,  1+  in  >10%  of  cells;  1+  in  1%— 25%  of  cells;  >50%  of  cells;  score  “0-8”, 
etc)  (6,  9-12,  14,  26). 

Because  there  were  different  levels  of  ERa  and  (3  immunohistochemical  expression 
detected  using  different  antibodies  in  ours  and  the  previous  studies  (6,  9-12,  14,  26),  we 
correlated  the  expression  of  ER  using  all  of  the  antibodies  we  tested  with  the  patients’ 
clinicopathological  features  and  the  tumors’  EGFR  mutation  status.  The  evaluation  of  multiple 
antibodies  for  ER  expression  adds  strength  to  our  findings.  In  our  study,  we  analyzed  the 
immunohistochemical  scores  as  continuous  and  dichotomized  variables,  and  a  significantly 
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higher  expression  of  nuclear  ERa  was  detected  with  all  four  antibodies  tested  in  adenocarcinoma 
than  squamous  cell  carcinoma  histology — three  out  of  the  four  antibodies  tested  in  tumors 
obtained  from  females  compared  with  males  and  from  people  who  had  never  smoked  compared 
with  smokers.  The  two  previous  studies  reporting  ERa  nuclear  expression  in  NSCLC,  which 
examined  a  relatively  large  series  of  cases,  did  not  address  differences  of  expression  based  on 
histology  types  or  patients’  clinicopathologic  features  (12,  26).  In  the  NSCLC  tissues  that  we 
reviewed,  higher  expression  of  ER(3  correlated  significantly  with  tumor  adenocarcinoma 
histology  and  the  patients’  female  gender  for  ER(3-1  antibody,  and  correlated  with  the  patients’ 
history  of  never  smoking  with  the  ER(3-2  antibody. 

Few  studies  have  shown  inconsistent  results  on  whether  ER  expression  is  biased  to  any 
gender  using  different  types  of  specimens  and  assays  (6-9).  Schwartz  et  al  (9),  using  a  different 
antibody  than  ours,  reported  that  NSCLCs  obtained  from  females  were  46%  less  likely  to  have 
ER|3-positive  tumors  than  males  in  a  multivariate  analysis.  In  addition,  mRNA  expression  of 
ERa  has  been  reported  to  be  significantly  higher  in  lung  tumors  from  women  than  from  men  (8). 
In  a  small  number  of  NSCLC  tumor  tissue  specimens,  ERa  and  [)  gene  transcripts  have  been 
found  to  be  expressed  in  similar  levels  when  comparing  samples  obtained  from  females  and 
males  (7).  Adenocarcinoma  of  the  lung,  which  shows  a  weaker  association  with  tobacco  smoking 
than  with  other  types  of  lung  cancer,  is  also  found  predominantly  in  women,  suggesting  a 
possible  role  for  female  hormones  in  the  pathogenesis  of  this  type  of  lung  cancer  (5). 

In  previous  studies,  ER(3  expression  in  NSCLC  tumors  has  been  associated  with 
improved  survival  (9-11),  whereas  the  immunohistochemical  expression  of  ERa  has  been  shown 
to  be  a  poor  prognostic  factor  (9).  Thus,  both  ERs  have  been  proposed  to  play  opposite  roles  in 
cell  proliferation,  with  ERa  promoting  proliferation  and  ER(3  having  an  anti-proliferative  effect 
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(36,  37).  In  our  study,  we  did  not  find  a  correlation  between  OS  and  RFS  and  ER(3  expression, 
but  we  did  find  that  only  the  expression  of  cytoplasmic  ERa  (using  one  antibody)  conferred  to 
patients  a  significantly  worse  RFS,  but  not  OS,  in  multivariate  analysis. 

Several  studies  have  shown  that  estrogen  signaling  plays  a  role  in  the  development  of  the 
epithelium  in  the  lung  and  that  estrogen  could  potentially  promote  lung  cancer  (6,  7,  13,  16,  17). 
Additionally,  anti-estrogen  drugs  have  been  suggested  to  have  a  role  in  the  therapy  of  lung 
cancer  (6,  19).  NSCLC  cell  lines  and  in  vivo  tumor  xenographts  have  been  shown  to  respond  to 
estrogens,  and  tumor  growth  can  be  inhibited  up  to  40%  by  the  anti-estrogen  fulvestrant  (6).  In 
the  past  few  years,  significant  advances  have  been  made  in  the  development  of  new  molecularly 
targeted  agents  for  lung  cancer  (38).  The  identification  of  the  subset  of  patients  with  NSCLC 
who  will  benefit  with  targeted  therapy  is  a  key  element  in  the  development  of  personalized 
treatment  approaches  in  this  disease.  A  pilot  study  of  combined  therapy  using  fulvestrant  and 
gefitinib  in  advanced  NSCLC  has  shown  to  be  well  tolerated  and  has  demonstrated  some  tumor 
responses  (39).  Our  study  results  strongly  suggest  that  NSCLC  tumors  obtained  from  patients 
with  adenocarcinoma  histology,  female  gender,  and  history  of  never  smoking  have  a  higher 
chance  of  expressing  ERs  and  have  the  potential  to  respond  positively  to  anti-estrogen  therapy. 

To  the  best  of  our  knowledge,  our  study  is  the  first  to  report  an  association  between 
EGFR  mutation  and  ERa  and  (3  expression  in  lung  adenocarcinomas.  Importantly,  we  have 
demonstrated  that  the  correlation  between  ER  expression  and  EGFR  mutation  is  independent  of 
the  clinicopathological  features  associated  with  both  abnormalities,  such  as  adenocarcinoma 
histology,  female  gender,  and  history  of  never  smoking  (40).  Based  on  the  interactions  between 
ER  and  EGFR-signaling  pathways,  there  is  evidence  showing  that  targeting  both  pathways  by 
using  anti-estrogens  (fulvestrant)  and  EGFR  tyrosine  kinase  inhibitors  (gefitinib),  the  antitumor 
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effect  in  in  vitro  and  in  vivo  lung  cancer  models  of  the  drug  combination  is  higher  than  in 
treatment  with  each  drug  alone  (19).  Thus,  our  findings  of  an  association  between  the  activation 
of  both  pathways  further  strengthens  the  concept  of  combined  anti-estrogen  and  EGFR  inhibitor 
therapy  for  a  selected  group  of  patients  with  lung  adenocarcinoma. 

Although  PR  expression  has  been  reported  to  be  present  in  NSCLC  cell  lines  and  tumor 
specimens,  the  data  are  controversial  like  those  for  ERs  (11,  12,  15,41,  42).  Out  of  four  studies 
reporting  on  immunohistochemical  expression  of  PR  in  surgical  resected  and  formalin- fixed 
NSCLC  tissue  specimens  using  different  antibodies,  there  were  two  studies  that  reported  a 
relatively  high  frequency  of  PR  expression  in  tumors  (39%  and  47%)  (12,  15);  the  remaining  two 
reports  showed  no  expression  (11,  42).  In  the  present  study,  PR  was  frequently  (63%)  detected 
in  the  nuclei  of  malignant  NSCLC,  with  a  trend  to  higher  expression  in  squamous  cell  carcinoma 
histology.  We  did  not  find  a  correlation  between  PR  and  any  of  the  clinicopathologic 
characteristics  we  studied,  including  survival.  In  contrast,  Ishibashi  et  al  reported  that  PR 
immunohistochemical  expression  was  higher  in  NSCLCs  obtained  from  females  and  correlated 
with  better  OS  in  stages  I— III  tumors  (12).  In  breast  cancer,  transcription  of  the  PR  gene  is  well 
known  to  be  regulated  by  estrogenic  actions  through  estrogen  receptors,  and  a  positive  PR  status 
is  generally  regarded  as  one  of  the  markers  of  functional  estrogenic  pathways.  In  our  study,  we 
found  no  statistical  correlation  between  PR  and  any  of  the  ER  antibodies  studied.  In  vitro  and  in 
vivo  studies  have  shown  that  administration  of  progesterone  inhibits  the  growth  of  PR-positive 
NSCLC  cell  lines,  which  is  similar  to  what  has  been  shown  to  happen  in  breast  and  endometrial 
carcinomas  (12). 

Lung  cancer  is  believed  to  develop  from  a  series  of  preneoplastic  lesions  in  the 
respiratory  mucosa,  and  these  abnormalities  are  frequently  extensive  and  multifocal  throughout 
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the  respiratory  epithelium,  indicating  a  field-effect  or  field-cancerization  phenomenon  (43).  Our 
findings  of  relatively  frequent  expression  of  nuclear  PR  and  lack  of  expression  of  ERa  in  the 
normal  epithelium  adjacent  to  adenocarcinomas  expressing  these  receptors  suggest  that  PR,  but 
not  ERa  expression,  may  represent  a  field-effect  phenomenon.  Of  interest,  all  but  one  case  with 
normal  epithelium  expression  of  PR  showed  expression  of  this  receptor  in  the  corresponding 
tumor.  The  frequent  finding  of  cytoplasmic  ER(3  in  normal  epithelium  may  represent  a 
constitutive  expression  in  normal  respiratory  cells  and  is  probably  not  related  to  the 
carcinogenesis  process  (35). 

In  summary,  our  findings  show  that  ERa  and  (3  and  PR  are  frequently  expressed  in 
NSCLC,  and  ER  expression  distinguishes  a  subset  of  NSCLC  that  has  defined  clinicopathologic 
and  genetic  features.  The  correlation  between  ER  and  EGFR  mutation  in  lung  adenocarcinoma 
suggests  that  it  might  be  important  to  target  both  pathways  simultaneously  for  lung  cancer 
chemoprevention  and  therapy. 
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Figure  Legends 

Fig.  1.  Fig.  1.  Microphotographs  showing  representative  examples  of  immunohistochemical 
expression  of  estrogen  receptors  (ER)  a  (panels  A,  B,  and  C)  and  (3  (panels  D,  E  and  F)  and 
progesterone  receptor  (PR;  panels  G,  H  and  I),  in  tissue  specimens  of  non-small  cell  lung 
carcinoma  (NSCLC)  tumors  and  bronchial  epithelium  (panels  A,  D,  and  G)  adjacent  to 
adenocarcinomas.  The  two  NSCLC  histologies  are  represented:  adenocarcinoma  (panels  B,  E 
and  H),  and  squamous  cell  carcinoma  (panels  C,  F  and  I).  ER  a  and  (3  expressions  are  shown 
using  antibodies  ER-a4  and  ER-(3l,  respectively.  Examples  of  nuclear  expression  and 
cytoplasmic  expressions  are  indicated  by  red  and  blue  arrows,  respectively.  Normal  epithelia 
show  nuclear  expression  of  ER-(3  and  PR,  and  cytoplasmic  expression  of  ER-(3. 

Adenocarcinomas  show  nuclear  expression  for  all  three  markers  and  cytoplasmic  staining  for 
ER-(3.  Squamous  cell  carcinomas  show  nuclear  staining  for  PR,  and  both  nuclear  and 
cytoplasmic  for  ER-(3.  The  magnification  of  the  microphotographs  is  200x  for  normal  epithelium 
and  400x  for  tumors. 

Fig.  2.  Kaplan-Meier  curves  showing  recurrence  free  survival  (RFS)  of  NSCLC  patients  for  ERa 
cytoplasmic  (panel  A)  and  ER(3  nuclear  (panel  B)  expression. 

Fig.  3.  Representative  examples  of  ERa  and  (3  immunohistochemical  expression  ( upper  figures ) 
and  epidermal  growth  factor  receptor  (EGFR)  mutations  ( lower  figures)  in  lung 
adenocarcinomas.  Panel  A :  ERa  (antibody  ERa-4)  positive  in  the  nucleus  of  malignant  cells  and 
sequencing  chromatograms  showing  the  presence  of  mutant  form  of  EGFR  (15  bp  deletion  in 
exon  19;  arrow  indicates  in- frame  deletion  mutation  sequence).  Panel  B :  ER(3  (antibody  ER(3-1) 
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positive  in  the  nucleus  of  malignant  cells  and  sequencing  chromatograms  showing  the  presence 
of  mutant  EGFR  (L858R  point  mutation  in  exon  21;  arrow  indicates  CTG  to  CGG  mutation). 
Panel  C:  ER  a  and  (3  (same  antibodies  than  panels  A  and  B )  with  negative  expression  in  the 
malignant  cells  and  sequencing  chromatograms  showing  the  presence  of  wild-type  form  for 
EGFR  exon  19  (line  indicates  sequence  746  to  750). 
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Table  1.  Frequency  of  ER  and  PR  immunohistochemical  expression*  in  NSCLC  tissue 


specimens 


Marker 

Location 

Adenocarcinoma 

Squamous  cell 

carcinoma 

P  value 

Number 

of  cases 

Positive 

N  (%) 

Number 

of  cases 

Positive 

N  (%) 

ERa-1 

Nucleus 

187 

20(11) 

109 

2(2) 

0.0048 

Cytoplasm 

187 

0 

108 

0 

__t 

ERa-2 

Nucleus 

186 

84  (45) 

110 

23  (21) 

<0.0001 

Cytoplasm 

185 

1(<1) 

111 

0 

1.000 

ERa-3 

Nucleus 

191 

16(8) 

114 

0 

0.0007 

Cytoplasm 

190 

92  (48) 

114 

37  (33) 

0.0064 

ERa-4 

Nucleus 

185 

74  (40) 

109 

25  (23) 

0.0028 

Cytoplasm 

185 

35(19) 

109 

18(17) 

0.6043 

ERp-1 

Nucleus 

189 

102  (54) 

112 

66  (59) 

0.4022 

Cytoplasm 

189 

185  (98) 

112 

110(98) 

1.0000 

ER(3-2 

Nucleus 

174 

83  (48) 

100 

31  (31) 

0.0069 

Cytoplasm 

172 

37  (22) 

100 

16(16) 

0.2685 

PR 

Nucleus 

177 

103  (58) 

112 

78  (70) 

0.05 

Cytoplasm 

176 

0 

112 

0 

Any  expression  score  >0  is  considered  positive. 
’'"Not  tested. 


Abbreviations:  ER,  estrogen  receptor;  PR,  progesterone  receptor;  NSCLC,  non-small  cell 
lung  cancer;  ERot,  estrogen  receptor-a;  ER(3,  estrogen  receptor-(3. 


Table  2.  Significant  correlations  between  immunohistochemical  expression  of  ER  and 
PR*  andNSCLC  patients’  clinicopathological  features. 


Histology 

Gender 

Tobacco  History 

ADCA  (n  =  201) 

Female  (n  =  167) 

Never  (n  =  54) 

> 

> 

> 

Estrogen  Receptor 

SCC  (n  =  116) 

Male  (n  =  150) 

Ever  (n  =  262) 

ERa-1  nucleus 

0.0048 

0.0051 

f 

ns1 

ERa-2  nucleus 

<0.0001 

0.0109 

0.0006 

ERa-3  nucleus 

0.0015 

ns 

0.0242 

ERa-4  nucleus 

0.0004 

0.0148 

0.0044 

ERp-1  nucleus 

ns 

ns 

0.0290 

ERp-2  nucleus 

0.0016 

0.044 

ns 

*ER  and  PR  were  tested  using  expression  score. 

+ns  =  not  statistically  significant. 

Abbreviations:  ER,  estrogen  receptor;  PR,  progesterone  receptor;  NSCLC,  non-small  cell 
lung  cancer;  ADCA,  adenocarcinoma  SCC,  squamous  cell  carcinoma;  ERa,  estrogen 
receptor-a;  ERp,  estrogen  receptor-p. 


Table  3.  Multivariate  RFS  analysis  using  Cox  regression  model  in  NSCLC  patients* 


Variable 

Hazard  ratio 

95%  Cl  of  HR 

P  value 

Lower  limit 

Upper  limit 

ER  as  continuous  variable 

ERa-4  cytoplasm 

1.05 

1.01 

1.08 

0.0068 

ER|f- 1  nucleus 

1.01 

1.00 

1.02 

0.0034 

Stage  II  vs.  I 

1.90 

1.14 

3.18 

0.0145 

Stage  III/IV  vs.  I 

3.17 

1.98 

5.08 

<.0001 

ER  dichotomized 

ERa-4  cytoplasm:  >0  vs.  0 

1.77 

1.11 

2.81 

0.0156 

ERp-1  nucleus:  >0  vs.  0 

1.36 

0.91 

2.05 

0.1388 

Stage  II  vs.  I 

1.79 

1.08 

2.99 

0.0250 

Stage  III/IV  vs.  I 

3.13 

1.97 

4.99 

<.0001 

*With  only  significant  covariates. 

Abbreviations:  NSCLC,  non-small  cell  lung  cancer;  Cl,  confidence  interval;  HR,  hazard 
ratio;  ER,  estrogen  receptor;  ERa,  estrogen  receptor-a;  ER(3,  estrogen  receptor-(3. 


Table  4.  Significant  correlations  between  immunohistochemical  expression  of  ER  and 
EGFR  mutation  status  in  adenocarcinoma 


EGFR  mutation  status 

ER  expression  by  Wild-type  Mutant 

antibody  N  positive/total  (%)  N  positive/total  (%)  P  value 


ERa-3  nucleus 

9/146  (6) 

7/28  (25) 

0.0016 

ERa-3  cytoplasm 

68/146  (47) 

21/27  (78) 

0.0029 

ERa-4  nucleus 

50/143  (35) 

18/27  (67) 

0.0020 

ERa-4  cytoplasm 

20/143  (14) 

13/27  (48) 

<0.0001 

ERp-1  nucleus 

70/145  (48) 

22/27  (82) 

0.00015 

*ER  and  PR  were  tested  using  expression  score. 

Abbreviations:  ER,  estrogen  receptor;  EGFR,  epidermal  growth  factor  receptor;  PR, 
progesterone  receptor;  ERa,  estrogen  receptor-a;  ER(3,  estrogen  receptor- (3. 
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Abstract 

Loss  of  genome-wide  methylation  is  a  common  feature  of  cancer,  and  the  degree  of 
hypomethylation  has  been  correlated  with  genomic  instability.  Global  methylation  of  repetitive 
elements  possibly  arose  as  a  defense  mechanism  against  parasitic  DNA  elements,  including 
retrotransposons  and  viral  pathogens.  Given  the  alterations  of  global  methylation  in  both  viral 
infection  and  cancer,  we  examined  genome-wide  methylation  levels  in  head  and  neck 
squamous  cell  carcinoma  (HNSCC),  a  cancer  causally  associated  with  human  papilloma  virus 
(HPV).  We  assayed  global  hypomethylation  levels  in  26  HNSCC  samples,  compared  with  their 
matched  normal  adjacent  tissue,  using  Pyrosequencing-based  methylation  assays  for  LINE 
repeats.  In  addition,  we  examined  cell  lines  derived  from  a  variety  of  solid  tumors  for  LINE  and 
SINE  (Alu)  repeats.  The  degree  of  LINE  and  Alu  hypomethylation  varied  among  different 
cancer  cell  lines.  There  was  only  moderate  correlation  between  LINE  and  Alu  methylation 
levels,  with  the  range  of  variation  in  methylation  levels  being  greater  for  the  LINE  elements. 
LINE  hypomethylation  was  more  pronounced  in  HPV-negative  than  in  HPV-positive  tumors. 
Moreover,  genomic  instability,  as  measured  by  genome-wide  loss-of-heterozygosity  (LOH)  SNP 
analysis,  was  greater  in  HNSCC  samples  with  more  pronounced  LINE  hypomethylation.  Global 
hypomethylation  was  variable  in  HNSCC.  Its  correlation  with  both  HPV  status  and  degree  of 
LOH  as  a  surrogate  for  genomic  instability  may  reflect  alternative  oncogenic  pathways  in  HPV- 
positive  versus  HPV-negative  tumors. 
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Introduction 

DNA  methylation  is  an  epigenetic  DNA  modification  that  occurs  via  the  action  of  DNA 
methyltransferases  on  CpG  dinucleotides.  Methylated  regions  of  DNA  are  associated  with 
chromatin  remodeling,  generally  occurring  in  areas  of  more  condensed  chromatin  and 
decreased  transcriptional  activity  [1 ,2],  Renewed  interest  in  this  process  arose  after  it  was 
recognized  that  two  types  of  aberrant  methylation  patterns  are  present  in  cancer  cells  [1 ,3].  The 
first  is  gene-specific  hyper-methylation,  where  CpG  islands  in  the  promoter  regions  of  genes 
acquire  increased  methylation,  generally  leading  to  reduced  expression  of  the  downstream 
gene.  The  second  is  genome-wide  hypo-methylation,  a  large  percentage  of  which  occurs  in 
repetitive  DNA  elements.  In  malignancy,  global  methylation  is  often  aberrantly  reduced, 
whereas  gene-specific  methylation  is  often  aberrantly  increased.  While  the  effects  of  gene- 
specific  hypermethylation  (e.g.,  reduced  expression  of  a  gene  that  is  important  for  growth 
control)  are  easily  appreciated,  the  effects  of  reduced  global  methylation  are  more  vague  [1,3]. 

It  has  been  hypothesized  that  DNA  methylation  initially  evolved  as  a  defense  mechanism 
against  viral  and  other  DNA  pathogens  as  a  way  to  silence  foreign  DNA  sequences  [4-6],  This 
is  consistent  with  the  observation  that  LINE  and  SINE  ( Alu )  elements,  originating  from 
transposable  elements,  are  heavily  methylated  in  normal  cells.  Methylation  of  the  HPV  viral 
genome  upon  integration  into  the  host  genome  has  been  reported,  and  changes  in  methylation 
of  HPV  DNA  have  been  associated  with  tumorigenesis  [7,8]. 

Global  methylation  is  also  clinically  relevant,  as  demonstrated  by  associations  between 
clinical  outcome  and  global  methylation  levels  in  a  number  of  cancer  types  [9-11].  From  a 
mechanistic  standpoint,  global  methylation  appears  to  be  related  to  cancer  progression,  since 
loss  of  global  methylation  tends  to  become  more  pronounced  as  precancerous  lesions  progress 
[12,13].  Furthermore,  in  colon  cancer  cells,  loss  of  LINE  methylation  is  inversely  correlated  with 
microsatellite  instability  and  is  directly  correlated  with  chromosome  instability  [14,15], 
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We  hypothesized  that  global  methylation  levels  in  cancer,  represented  by  levels  of  LINE 
and  SINE  (Alu)  methylation,  might  be  correlated  with  viral  infection.  We,  therefore,  examined 
LINE  methylation  in  relation  to  HPV  status  in  head  and  neck  cancers.  In  contrast  to  cervical 
cancers,  which  are  nearly  all  associated  with  HPV  infection,  HNSCC  is  virally  mediated  in  only  a 
subset  of  cases  (25-30%)  [16].  To  determine  the  impact  of  viral  infection  on  global  methylation 
levels  in  HNSCC,  we  developed  Pyrosequencing-based  methylation  assays  for  repetitive  DNA 
elements  and  compared  HPV-positive  with  HPV-negative  cancers.  Previous  publications  have 
suggested  that  LINE  methylation  is  variable  in  HNSCC,  but  did  not  find  an  association  between 
HPV  DNA  status  and  LINE  methylation  levels  [21 ,23],  Here  we  show  that  global 
hypomethylation  is  variable  in  HNSCC  and  correlates  with  both  HPV  status  and  genomic 
instability. 

Results 

LINE/SINE  (Alu)  assays  are  precise  and  reproducible,  but  show  only  moderate  correlation 
between  LINE-1  and  Alu  methylation  levels 

To  assay  global  methylation  levels,  we  adapted  our  Pyrosequencing-based  Methylation 
Analysis  (PMA)  assay  [17]  to  assess  methylation  of  repetitive  LINE  and  SINE  (Alu)  elements, 
similar  to  genome-wide  methylation  assays  reported  previously  [18].  To  validate  the  assays,  we 
performed  a  series  of  tests:  (1)  mixing  experiments  with  known  amounts  of 
methylated/unmethylated  DNA;  (2)  methylation  studies  on  a  variety  of  cancer  cell  lines  to 
compare  LINE  to  SINE  methylation  and  to  survey  global  methylation  across  a  broad  range  of 
malignancies;  and  (3)  methylation  of  samples  from  different  ages  and  genders  to  eliminate 
these  factors  as  possible  confounders  of  global  methylation  measurements. 

Stepwise  increments  of  methylated  DNA  were  prepared  by  mixing  universally 
unmethylated  (U2M.L)  DNA  with  universally  methylated  (UM.L)  DNA  in  various  proportions. 
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The  mixed  samples  were  then  bisulfite  treated  and  subjected  to  our  PMA  LINE-1  (LINE)  and  Alu 
(SINE)  assays.  Linear  regression  analysis  showed  that  LINE-1  and  Alu  methylation  levels  were 
very  closely  associated  with  levels  predicted  by  input  fraction  of  methylated  DNA  (r  =  0.995,  p- 
value  <0.0005  for  LINE-1 ,  and  r  =  0.980,  p-value<0.0005  for  Alu\  Figure  SI ).  It  should  be  noted 
that  even  when  input  DNA  was  completely  methylated,  the  maximum  global  methylation 
percentage  as  measured  by  the  PMA  assay  is  just  over  50%,  not  100%.  This  is  because 
individual  repetitive  elements  have  diverged  in  sequence  over  time;  and  CpG  dinucleotides,  if 
mutated  to  TpG  dinucleotides,  are  indistinguishable  from  unmethylated  CpGs  after  the  bisulfite 
treatment.  This  level  of  background  noise  is  taken  into  account  by  normalizing  each  result  to 
the  universally  methylated  control  (i.e.,  reporting  the  percent  methylated  reference,  or  PMR). 

Four  pools  of  normal  DNA  samples  (from  peripheral  blood  leukocytes)  were  generated 
to  assess  the  influence  of  age  and  gender  on  LINE-1  and  Alu  methylation  levels.  Each  pool 
(females  <40  years  old,  males  <40  years  old,  females  >40  years  old,  males  >40  years  old) 
contained  DNA  from  at  least  five  individuals.  There  were  no  sex-  or  age-dependent  differences 
between  the  pools  in  LINE-1  or  Alu  methylation  levels  (data  not  shown). 

Each  LINE-1  and  Alu  methylation  assay  was  tested  on  a  panel  of  23  cancer  cell  lines. 
Different  CpG  sites  were  compared  by  means  of  three  distinct  LINE-1  assays  and  three  distinct 
Alu  assays,  each  derived  from  different  regions  of  the  LINE-1  and  Alu  consensus  sequences, 
respectively  (Table  1).  As  a  control,  we  also  tested  seven  normal  lymphoblastoid  cell  lines, 
which  showed  normal  levels  of  methylation  (all  >80%  in  our  LINE-1  assays).  In  contrast,  as 
expected  from  previous  reports  [12-14,19],  many  of  the  tumor  cell  lines  showed  global 
hypomethylation,  which  was  most  pronounced  in  the  LINE-1  assays  (Figure  1).  To  check  the 
consistency  between  LINE  and  SINE  methylation  levels  (represented  by  our  LINE-1  and  Alu 
assays,  respectively),  we  compared  the  degree  of  correlation  between  results  from  the  three 
LINE-1  assays,  between  results  from  the  three  Alu  assays,  and  between  the  various  LINE-1  and 
Alu  assays.  Linear  regression  analysis  showed  that  the  individual  LINE-1  assay  results  were 
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highly  correlated  with  each  other,  for  example  r  =  0.93  for  the  correlation  between  the  LI -2  and 
LI -3  assay  results  (Figure  S2A).  The  same  high  degree  of  correlation  was  present  between 
results  from  the  Alu  assays,  for  example  r  =  0.82  for  the  correlation  between  the  Alu-1  and  Alu-3 
assay  results  (Figure  S2B).  However,  LINE-1  assay  results  were  only  moderately  correlated 
with  Alu  assay  results,  for  example  r=  0.33  comparing  LI-1  and  Alu-1;  Figure  S2C).  A  similar 
level  of  moderate  correlation  between  LINE-1  and  Alu  assays  was  reported  previously  in 
neuroendocrine  tumors  [20]. 

LINE-1  hypomethylation  of  HNSCC  patient  samples 

Matching  normal,  primary  tumor,  and  where  available,  lymph  node  metastases  of  head 
and  neck  patient  samples  (Table  SI)  were  tested  using  our  PMA  LINE-1  assays  (Figure  2). 
Because  of  the  greater  dynamic  range  of  the  LINE-1  assay  and  limited  sample  amounts,  only 
LINE-1  assays  were  performed  for  the  remainder  of  this  study.  In  general,  HNSCC  primary 
tumors  and  metastatic  lymph  nodes  were  hypomethylated  compared  to  their  matching  normal 
adjacent  tissues.  With  the  LINE1-1  assay,  the  mean  primary  tumor  PMR  was  65.5%,  while  the 
mean  normal  PMR  was  90.0%  (p-value  =  6.7x1  O'8  using  a  paired  f- test).  However,  there  was 
quite  a  bit  of  variability  in  the  primary  tumors,  ranging  from  31.2%  (severe  hypomethylation)  to 
90.8%  (normal).  The  mean  PMR  in  lymph  node  metastases  (73.8%;  range  31.8%-93.8%)  was 
also  highly  variable,  even  with  respect  to  the  corresponding  primary  tumor,  sometimes  lower 
and  sometimes  higher  than  the  PMR  of  the  primary  tumor  with  which  it  is  associated. 

HPV-positive  tumors  retain  more  LINE-1  methylation  than  HPV-negative  tumors 

Because  there  was  so  much  variability  in  the  HNSCC  tumor  PMR,  we  hypothesized  that 
the  level  of  global  hypomethylation  might  vary  in  different  subgroups  of  HNSCC.  To  explore  the 
relationship  between  loss  of  LINE-1  hypermethylation  and  HPV  status,  we  compared  the  mean 
LINE-1  methylation  level  of  three  groups  that  differed  by  their  HPV  status.  The  first  group  (n  = 
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8)  was  HPV  negative  (-/-);  the  second  group  (n  =  8)  contained  HPV  DNA,  yet  were 
transcriptionally  silent  with  respect  to  the  E6  viral  oncogene,  indicating  lack  of  expression  of  this 
oncoprotein  (+/-).  The  third  group  of  tumors  (n  =  8)  was  positive  for  HPV  DNA  and  E6 
expression  (+/+).  The  mean  methylation  level  of  the  three  groups  of  tumors  is  statistically 
different,  with  a  p-value  =  0.01 1  for  the  trend  (Figure  3). 

We  also  compared  global  methylation  levels  in  matched  normal  adjacent  tissues  from 
the  same  three  groups.  In  contrast  to  the  results  for  the  primary  tumors,  there  were  no 
differences  in  global  methylation  levels  in  the  matching  normal  tissues  and  therefore  no 
correlation  with  HPV  status  (Figure  3).  Based  on  a  previously  reported  association  between 
LINE-1  methylation  and  T-N-M  stage  [21],  we  looked  for  such  an  association  in  our  sample  set, 
but  found  none  (p-value  =  0.31). 

Levels  of  LINE-1  hypomethylation  and  LOH  in  HNSCC  tumors  are  correlated 

Colon  cancer  cell  lines  with  more  pronounced  LINE  hypomethylation  were  previously 
reported  to  have  a  higher  degree  of  LOH  [14,15].  To  examine  whether  this  correlation  held  true 
in  our  HNSCC  patient  samples,  we  performed  a  global  LOH  analysis  using  10K  SNPChip  data. 
Genotypes  were  compared  between  normal  adjacent  tissue  and  matched  tumor  samples; 
informative  loci  were  those  that  were  heterozygous  in  the  normal  tissue.  Figure  4  shows  a  plot 
of  the  percentage  of  informative  loci  for  each  primary  tumor  with  LOH  versus  the  degree  of 
LINE-1  methylation  in  the  same  specimen.  We  fit  a  linear  trend  to  the  data,  which  showed  a 
Pearson  correlation  of  -0.494,  with  a  p-value  =  0.017.  As  a  check  of  robustness,  we  also 
calculated  the  Spearman  (rank-based)  correlation,  which  was  also  significant.  The  Spearman 
correlation  coefficient  for  this  relationship  was  -0.428  (p-value  =  0.042),  establishing  that  LOH 
was  indeed  significantly  correlated  with  the  degree  of  LINE  hypomethylation. 
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Discussion 

We  developed  several  LINE  (LINE-1)  and  SINE  (Alu)  methylation  assays  with  PCR 
primers  in  conserved  regions  of  these  repetitive  elements.  These  assays  utilize  multiple  CpG 
sites  (3-6)  to  determine  methylation  levels  and  enable  the  simultaneous  amplification  of  many 
individual  LINE  and  SINE  elements  throughout  the  human  genome  as  representative  genomic 
landmarks  for  global  methylation  analysis.  As  expected,  results  from  individual  LINE-1  assays 
were  very  well  correlated  with  results  from  other  LINE-1  assays,  despite  measuring  methylation 
in  different  regions  of  the  LINE-1  sequence.  The  same  is  true  for  Alu  assays,  with  high 
correlation  between  results  from  three  different  assays.  However,  when  LINE-1  and  Alu 
methylation  levels  were  compared  with  each  other,  the  correlation  was  more  modest,  only  about 
40%.  The  reasons  for  this  lower  correlation  may  be  related  simply  to  differences  in  assay 
sensitivity:  LINE-1  assays  ranged  from  0-50%  methylated  in  mixing  studies,  while  Alu  assays 
had  less  amplitude,  ranging  from  0-30%  methylated,  possibly  secondary  to  higher  inter¬ 
individual  background  noise  due  to  sequence  variation  between  individual  Alu  elements  (Figure 
SI).  Another  possibility  is  that  there  is  a  functional  or  biological  difference  between  the  two 
types  of  repetitive  DNA  in  their  methylation  maintenance.  LINE  repeats  are  more  frequent  in 
gene-poor  regions  of  the  genome,  while  Alu  elements  are  more  common  in  gene-rich  regions 
[22],  Since  global  methylation  can  be  measured  by  a  variety  of  methods,  these  inter-assay 
differences  should  be  considered  when  comparing  results  using  different  types  of  assays  and 
studies. 

Our  results  show  decreases  in  global  methylation  in  cell  lines  from  a  variety  of  cancer 
cell  types  (Figure  1 ),  similar  to  previously  published  results  for  a  variety  of  tumor  types  [12- 
14,19].  It  is  worth  noting  that  the  cancer  cell  lines  in  general  had  lower  levels  of  methylation 
than  the  HNSCC  samples,  perhaps  reflecting  a  longer  time  to  accumulate  methylation  loss  or 
the  clonal  nature  of  these  cell  lines.  However,  some  cell  lines  (e.g.,  RKO)  had  little  if  any  loss  of 
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methylation.  This  implies,  as  for  the  primary  HNSCC  tumors,  that  there  is  variability  in  the 
underlying  biology.  Further  research  into  what  causes  this  variability  could  provide  important 
information  about  pathways  of  and  the  role  of  epigenetic  alterations  in  cancer  progression. 

Our  results  demonstrate  a  positive  correlation  between  maintenance  of  normal  LINE 
methylation  and  HPV-positivity.  In  addition,  maintenance  of  normal  LINE  methylation  was  also 
correlated  with  less  LOH  or  genome  instability.  If  LOH  is  viewed  as  a  surrogate  for 
chromosome  instability  (CIN),  this  result  is  consistent  with  the  previously  reported  result  that 
colon  cancers  also  have  an  association  between  CIN  and  loss  of  LINE  hypermethylation 
[14,15],  Thus,  it  is  tempting  to  speculate  that  LINE  methylation  and  CIN  are  causally  related, 
perhaps  via  methylation’s  known  association  with  more  densely  chromatin  packaging,  which 
may  translate  into  more  fidelity  during  chromosome  segregation  and  therefore  less  LOH. 
However,  our  results  and  previously  reported  results  are  purely  correlative;  functional  studies 
will  be  necessary  before  this  causal  relationship  can  be  established. 

The  novel  finding  in  our  results  is  that  HPV-positivity  is  correlated  with  maintenance  of 
LINE  methylation.  A  recent  epidemiologic  study  identifying  risk  factors  for  LINE-1 
hypomethylation  reported  no  significant  association  between  HPV  DNA  status  and  LINE-1 
methylation  levels  [23].  However,  there  were  methodological  differences  between  that  and  our 
study,  including  the  use  of  both  HPV  E6  DNA  and  RNA  status  in  the  assignment  of  HPV  status, 
the  use  of  a  different  methylation  assay,  and  the  use  of  methylation  level  as  a  continuous 
variable  in  the  analysis.  Although  Furniss  and  colleagues  did  not  show  a  direct  association, 
they  showed  that  outcomes  varied  by  LINE-1  methylation  levels  for  HPV-negative  tumors  [23], 
Further  studies  are  needed  to  clarify  the  association  between  HPV  status  and  LINE  methylation. 
One  possible  hypothesis  is  that  in  an  attempt  to  silence  the  HPV  virus,  infected  cells  induce  a 
more  exuberant  methylation  response,  harkening  back  to  the  origins  of  methylation  as  a  viral 
defense  mechanism.  Although  this  will  again  require  mechanistic  studies  or  perhaps  studies  in 
additional  types  of  virally  mediated  cancers  (e.g.,  Hepatitis  B  and  C  mediated  HOC  compared 
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with  non-virally  induced  HCC,  or  EBV+  lymphoma  vs.  EBV-  lymphoma)  to  establish  causality, 
our  results  along  with  those  of  Furniss  and  colleagues  [23]  certainly  provide  more  support  for 
the  hypothesis  that  HPV-positive  HNSCC  and  HPV-negative  HNSCC  represent  distinct  biologic 
entities  that  arise  via  separate  oncogenic  pathways. 

In  summary,  we  developed  several  new  LINE  (LINE-1)  and  SINE  (Alu)  whole-genome 
methylation  assays,  which  we  applied  to  determine  the  methylation  status  of  a  variety  of  cancer 
cell  lines  and  HNSCC  primary  tumor  samples.  We  find  that  cancer  cell  lines  in  general  have 
decreased  methylation  levels  of  repetitive  elements.  Even  more  variability  in  LINE-1 
methylation  was  noted  within  HNSCC  samples.  Importantly,  we  discovered  a  correlation 
between  HPV-negativity,  increased  genome  instability,  and  loss  of  genome  methylation.  This 
correlation  reinforces  the  concept  that  HPV-positive  and  HPV-negative  cancers  are  biologically 
distinct  and  provides  a  basis  for  future  studies  to  further  define  the  biologic  mechanism 
underlying  these  findings. 

Materials  and  Methods 

Patient  samples  and  cell  lines 

Matched  tumor/normal  adjacent  tissue  samples,  and  when  available  cervical  lymph  node 
metastases  from  HNSCC  patients  were  collected  at  Ohio  State  University  as  described 
previously  (Table  SI )  [24].  Genomic  DNA  was  extracted  from  the  DNA-protein  phase  of  TriZol- 
extracted  tissues  according  to  the  manufacturer’s  suggestions  (Invitrogen).  DNA  was  extracted 
using  the  PureGene  kit  (Gentra)  on  cell  pellets  from  four  HNSCC  cell  lines  (SCC-4,  SCC-9, 
SCC-15  and  SCC-25),  five  lung  cancer  cell  lines  (HI 395,  H520,  H2170,  SK-MES-1  and  SW- 
900),  one  breast  cancer  cell  line  (MCF7),  one  cervical  cancer  cell  line  (HeLa),  three  brain 
cancer  cell  lines  (U251,  SK-N-AS  and  M059K),  one  uterine  cancer  cell  line  (AN3CA),  one 
sarcoma  cell  line  (HT1080),  one  kidney  cancer  cell  line  (HEK293),  and  six  colon  cancer  cell 
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lines  (LoVo,  SW48,  HCT-15,  DLD-1,  COLO  320DM  and  RKO)  according  to  the  manufacturer’s 
suggestions.  In  addition,  DNA  from  the  lymphoblastoid  cell  line  BL1395  was  used  as  a 
matching  control  to  HI 395.  All  cell  lines  are  available  from  ATCC  (Manassas,  VA). 

Normal  pools  and  methylated  controls 

Four  pools  of  normal  samples  were  generated  representing  different  genders  and  ages. 
DNA  samples  were  obtained  from  anonymous  blood  donors  and  were  a  gift  of  Dr.  Michael  J. 
Siciliano  (University  of  Texas  M.  D.  Anderson  Cancer  Center).  Three  of  the  pools  (females 
older  than  40  years  of  age,  females  age  40  or  under,  and  males  age  40  or  under)  were  each 
comprised  of  five  individuals  per  pool.  The  fourth  pool  (males  older  than  40  years  of  age)  was 
comprised  of  six  individuals.  Commercially  prepared  universally  methylated  and  universally 
unmethylated  DNA  (UM.C  and  U2M.C,  respectively)  were  obtained  from  Chemicon.  UM.L 
(universally  methylated  DNA  )  was  generated  as  a  positive  control  by  treating  normal  peripheral 
blood  leukocyte  (PBL)  DNA  with  the  CpG  methylase  M.Sss\  (New  England  Biolabs)  [25], 

U2M.L  (universally  unmethylated  DNA)  was  generated  as  a  negative  control  by  amplifying  the 
same  DNA  used  to  generate  the  positive  control  DNA  using  the  GenomiPhi  kit  (GE  Healthcare) 
as  described  by  the  manufacturer. 

Primers  and  PCR  conditions 

PCR  primers  and  sequencing  primers  were  designed  by  using  PSQ  Assay  Design 
software  (Biotage)  [26],  Three  assays  for  SINE  ( Alu )  elements  and  three  assays  for  LINE-1 
elements  were  designed  (Table  1).  PCR  was  performed  in  a  25  pi  reaction  containing  Qiagen 
HotStart  Taq  master  mix  (Qiagen)  using  1  pi  bisulfite  treated  DNA  (10  ng  of  DNA  equivalents). 
To  reduce  the  cost  per  assay,  the  amplification  protocol  was  developed  using  a  biotinylated 
universal  primer  approach.  Final  primer  concentrations  were  1 0  nM  of  the  primer  tailed  with  the 
universal  primer,  100  nM  of  the  untailed  primer,  and  90  nM  of  the  universal  biotinylated  primer  in 
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each  reaction  [17].  The  amplification  was  carried  out  at  the  following  conditions:  denaturation  at 
95°C  for  5  min,  followed  by  45  cycles  at  95°C  for  30  sec,  45°C  (SINEs)  or  53°C  (LINEs)  for  1 
min,  72°C  for  45  sec,  and  a  final  extension  at  72°C  for  7  min  [17]. 

PyroMethA  (PMA)  and  methylation  assessment 

Bisulfite  conversion  of  genomic  DNA  was  done  as  reported  previously  [17].  Briefly,  0.5- 
1 .0  pg  of  genomic  DNA  was  treated  using  the  CpGenome  DNA  modification  kit  (Chemicon), 
including  DNA  sulfonation,  deamination,  desalting,  desulfonation  and  recovery.  Bisulfite-treated 
DNA  was  stored  at  -20°C  until  use.  PMA  is  a  Pyrosequencing-based  technology  that  can 
analyze  CpG  methylation  at  multiple  sites  in  a  single  assay.  After  a  PCR  amplification  using 
bisulfite  treated  DNA,  Pyrosequencing  was  carried  out  using  the  PSQ96HS  system  (Biotage) 
according  to  the  manufacturer’s  protocol  including  single  strand  binding  protein  (PyroGold 
reagents).  The  results  were  analyzed  using  Q-CpG  software  (Biotage),  which  calculates  the 
methylation  percentage  (mC/(mC+C))  for  each  CpG  site,  allowing  quantitative  comparisons.  The 
methylation  index  (called  Ml)  was  calculated  as  the  average  value  of  mC/(mC+C)  for  all 
examined  CpG  sites  in  the  assay.  In  general,  there  was  very  good  agreement  in  methylation 
levels  among  individual  CpG  sites  in  the  same  assay.  Bisulfite-treated  UM.L  was  used  as  the 
universally  methylated  reference.  PMA  data  for  these  global  methylation  assays  are  reported 
as  a  percentage  of  methylated  reference  (PMR)  value,  normalizing  the  Ml  of  each  sample  to  the 
Ml  of  the  universally  methylated  reference  (UM.L)  DNA  [25]. 

Commercial  universally  methylated  DNA  (UM.C)  and  universally  unmethylated  DNA 
(U2M.C)  (Chemicon)  were  also  used  to  perform  this  analysis,  and  the  result  was  linear  (r  = 
0.983,  p-value  <  0.0005  for  LINE-1 ;  Figure  SI ).  However,  commercially  available  U2M  was  not 
completely  unmethylated,  since  a  pure  U2M.C  sample  had  residual  methylation  of 
approximately  26%,  different  from  U2M.L  prepared  in  our  lab,  which  had  the  expected  0% 
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methylation.  We  therefore  used  our  own  universally  unmethylated  DNA  (U2M.L)  for  all  further 
studies. 

Detection  of  HPV16  E6  DNA  and  E6  RNA  to  determine  HPV  status 

Quantitative  real-time  PCR  was  performed  to  detect  either  HPV16  E6  DNA  or  E6  cDNA 
using  the  same  primer/probe  set  for  both.  The  primers  were  designed  using  the  HPV16 
serotype,  which  accounts  for  >90%  of  all  HPV-positive  HNSCC  cases  [27],  To  control  for 
possible  genomic  DNA  contamination  in  the  cDNA,  amplifications  of  cDNA  from  DNAse-treated 
RNA  that  had  been  prepared  using  the  Superscript  First  Strand  Synthesis  System  (Invitrogen) 
both  with  and  without  the  addition  of  reverse  transcriptase  were  compared.  PCR  was 
performed  in  a  25  pi  reaction  containing  iQ  supermix  master  mix  (Biorad)  using  25  ng  of 
genomic  DNA  or  5  pi  of  cDNA.  Forward  primer  (5’-CTGCAATGTTTCAGGACCCA-3’)  and 
reverse  primer  (5’-TCATGTATAGTTGTTTGCAGCTCTGT-3’)  were  added  to  a  final 
concentration  of  200  nM  each.  The  Texas  Red  labeled  real-time  probe  (5’-TR- 
AGGAGCGACCCAGAAAGTTACCACAGTT-3’)  was  added  to  a  final  concentration  of  320  nM. 
Fluorescein  was  added  to  each  reaction  at  a  final  concentration  of  10  pM.  Each  reaction  was 
performed  in  triplicate.  The  amplification  was  carried  out  at  the  following  conditions: 
denaturation  at  95°C  for  8.5  min,  followed  by  50  cycles  at  95°C  for  15  sec,  60°C  for  1  min  and 
analyzed  in  real  time  using  an  iCycler  PCR  machine  and  software  (Biorad).  Samples  that  did 
not  amplify  were  scored  as  negative,  and  all  samples  were  grouped  into  3  categories  based  on 
these  results:  (+/+),  positive  for  both  E6  DNA  and  E6  RNA;  (+/-),  positive  for  E6  DNA,  but 
transcriptionally  silent;  and  (-/-),  negative  for  E6  DNA  and  RNA.  HPV  status  was  successfully 
determined  for  24  of  the  26  HNSCC  samples,  and  these  were  used  for  subsequent  analyses 
utilizing  HPV  status. 

10K  SNPChip  LOH  analysis 
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HNSCC  genomic  DNAs  were  extracted  using  the  standard  TriZol-extraction  protocol; 
they  were  further  purified  by  ethanol  precipitation  before  and  after  whole  genome  amplification 
using  the  GenomiPhi  kit  (GE  Healthcare).  The  Affymetrix  10K  Xba131  array  contains 
approximately  1 1 ,500  SNPs  with  an  average  spacing  of  210  kb.  Standard  Affymetrix  protocols 
were  followed  for  these  assays.  Briefly,  about  250  ng  genomic  DNA  was  digested  with  Xba\ 
and  then  ligated  to  adaptors.  Next,  one-primer  amplification  was  carried  on  by  using  the 
GeneAmp  PCR  System  9700  (Applied  Biosystems).  After  purification  with  Qiagen  MinElute  96 
UF,  a  total  of  about  20  pg  of  PCR  product  was  fragmented  and  labeled  with  biotin. 

Hybridization  was  performed  in  the  Affymetrix  GeneChip  Hybridization  Oven  at  48°C  for  16-18 
hours.  Arrays  were  washed  and  stained  with  the  Affymetrix  GeneChip  Fludics  Station  400  and 
were  scanned  with  the  Affymetrix  GeneArray  2500  Scanner.  Image  processing  was  performed 
with  GCOS  1 .0  software  and  genotypes  were  generated  with  GTYPE  2.0  or  higher  software. 

Statistical  analysis 

In  order  to  assess  the  association  between  levels  of  methylation  and  levels  of  HPV  load 
(a  similar  analysis  was  also  used  to  assess  the  association  between  levels  of  methylation  and 
T-N-M  stage),  we  proceeded  as  follows.  All  sample  methylation  values  were  converted  to 
ranks.  One  missing  value  for  the  LINE1-1  assay,  for  sample  P2,  was  imputed  using  the  P2 
value  from  the  LINE1-3  (correlation  between  the  LINE1-1  and  LINE1-3  assays  is  0.98).  These 
ranks  were  then  scaled  (weighted)  by  HPV  load,  with  weights  of  0,  1  and  2  for  -/-,  +/-,  and  +/+, 
respectively.  The  final  association  "score"  was  the  sum  of  these  weighted  ranks.  The  null 
distribution  for  this  score  was  assessed  by  simulations  in  which  we  repeatedly  allocated 
samples  to  HPV  groups  at  random.  We  ran  one  million  simulations,  and  defined  our  p-value  as 
two  times  the  proportion  of  cases  in  which  the  simulation  score  was  as  large  or  larger  than  the 
one  we  actually  saw.  Our  simulated  p-value  was  0.01 1 . 
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Supporting  Information 


Table  SI  Clinical  and  molecular  features  of  HNSCC  samples. 


Figure  SI  Dynamic  range  of  PMA  LINE-1  and  Alu  Assays. 


Figure  S2  Correlation  analysis  of  LINE  and  SINE  (Alu)  global  methylation  assays. 
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Table  1.  LINE-1  and  Alu  PMA  Assays. 


Assays 

GenBank 

Number 

[Ref.] 

F  Primer  Seq 
(5’~3’) 

R  Primer  Seq 
(5-3) 

Sequencing  Primer 
(5-3) 

Amplicon 
Size  (bp) 

No. 

CpG 

Sites 

LI-1 

X52235 

tttattagggagtgttagatagtggg 

GACGGGACACCGCTGATCGTTTA 

cttcccaaataaaacaatacc 

tgggygtaggttagtgggtg 

(F) 

117 

6 

LI -2 

X52235 

GACGGGACACCGCTGATCGTTTA 

ggtattgttttatttgggaag 

ccctcctaaccaaatacaaaat 

ccaaatacaaaatataatct 

(R) 

196 

4 

LI -3 

M80343 

attagggagtgttagatagtggg 

GACGGGACACCGCTGATCGTTTA 

ccccttacrcttcccaaat 

gygtaggttagtgtgtgtg  (F) 

123 

5 

Alu-1 

[26] 

GACGGGACACCGCTGATCGTTTA 

tttttattaaaaatataaaaaattagt 

ccaaactaaaatacaataa 

aaactaaaatacaataac 

(R) 

170 

3 

Alu-2 

[26] 

GACGGGACACCGCTGATCGTTTA 

tttttattaaaaatataaaaaattagt 

tcaacctcccraataactaaaa 

aataactaaaattacaaac 

(R) 

96 

5 

Alu-3 

J00085 

gagagaattgtttgaatttagga 

GACGGGACACCGCTGATCGTTTA 

cactatcacccaaactaaaataca 

tgaatttaggaggtgga  (F) 

102 

4 

The  sequence  for  the  biotin-labeled  universal  primer  (5’ — 3’)  for  the  3-primer  PCR  is  underlined,  Biotin- 

GGGACACCGCTGATCGTTTA.  F,  sequencing  primer  extends  in  the  forward  direction;  R,  sequencing  primer  extends  in  the  reverse 
direction,  r,  purine  (A  or  G);  y,  pyrimidine  (C  or  T). 
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Figure  Legends 

Figure  1.  Alu  and  LINE-1  methylation  in  a  variety  of  tumor  cell  lines.  A.  Alu  methylation 
using  the  Alu-3  assay.  B.  LINE-1  methylation  using  the  LI -3  assay.  Results  are  reported  as 
the  PMR  (percent  methylated  reference)  normalized  to  the  universally  methylated  reference 
DNA.  Universally  unmethylated  (U2M),  universally  methylated  (UM)  controls,  and  normal 
control  DNA  from  PBLs  are  also  shown. 

Figure  2.  LINE-1  methylation  in  HNSCC  tumor  samples.  Paired  tumor  and  normal  samples 
and,  where  available,  lymph  node  metastases  were  assayed  for  LINE-1  methylation.  PMR 
values  are  plotted  using  the  normal  sample  PMR  as  the  x-value  and  primary  tumor  (circles)  or 
metastasis  (triangle)  PMR  as  the  y-value.  Horizontal  lines  represent  PMR  values  for  universally 
methylation  (UM)  and  unmethylated  (U2M)  controls,  normal  lymphocytes  (green)  and  for  four 
HNSCC  cell  lines  (yellow).  Box  plots  on  the  right  show  the  mean  and  distribution  of  primary 
tumors  and  metastases  in  the  subset  of  samples  where  matched  metastases  and  primary 
tumors  were  available. 

Figure  3.  LINE-1  methylation  levels  are  lower  in  HPV-negative  HNSCC.  Box  plots  of 
methylation  levels  (PMR)  of  HNSCC  primary  tumors  (right  in  each  pair)  and  normal  adjacent 
tissue  (left  in  each  pair)  are  shown  according  to  HPV  status.  +/+,  HPV-positive  and  expressing 
E6  viral  mRNA;  +/-,  HPV-positive  but  transcriptionally  silent;  and  -/-,  HPV  negative. 

Figure  4.  LINE-1  hypomethylation  is  correlated  with  degree  of  LOH  in  HNSCC.  LINE-1 
methylation  is  plotted  as  PMR,  and  LOH  is  the  fraction  of  10K  SNP  loci  that  show  LOH  relative 
to  all  informative  loci.  Pearson  correlation  coefficient  is  -0.494  (p-value  =  0.01)  for  the 
correlation  between  the  two  values. 
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Figure  1 
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Table  SI.  Clinical  and  molecular  features  of  HNSCC  samples. 


Patient 

No. 

PMR-nl 

PMR-prim 

PMR-met 

HPV  Status 

#LOH 

#AB 

#LOH/#AB 

Primary  Tumor  Site 

Sex 

Age 

Tc 

Nc 

Me 

Stage 

2 

0.96 

0.91 

NA 

+/+ 

211 

1776 

0.119 

Oropharynx/tonsil 

M 

47 

4 

2 

0 

IV 

5 

0.91 

0.66 

0.93 

-/- 

21 

2532 

0.008 

Larynx/subglottis 

M 

76 

4 

2 

0 

IV 

6 

0.91 

0.41 

0.32 

NA 

NA 

NA 

NA 

Pharynx 

M 

50 

NA 

NA 

NA 

NA 

7 

0.89 

0.31 

0.46 

-/- 

557 

2175 

0.256 

Larynx/supraglottis 

M 

48 

2 

3 

0 

IV 

8 

0.95 

0.91 

0.9 

+/+ 

81 

2446 

0.033 

Oral  cavity/tongue 

M 

65 

3 

2 

0 

IV 

9 

0.95 

0.84 

0.83 

+/- 

56 

2105 

0.027 

Oral  cavity/tongue 

M 

45 

3 

1 

0 

III 

10 

0.91 

0.88 

0.9 

+/+ 

38 

2674 

0.014 

Oropharynx/tonsil 

M 

54 

1 

2 

0 

IV 

11 

0.92 

0.52 

0.94 

+/+ 

187 

2831 

0.066 

Larynx/glottis 

M 

64 

4 

1 

0 

IV 

12 

0.95 

0.79 

0.82 

+/+ 

39 

2582 

0.015 

Oropharynx/BOT 

M 

53 

3 

1 

0 

III 

13 

0.92 

0.63 

NA 

+/+ 

NA 

NA 

NA 

Hypopharynx/pyriform  sinus 

M 

64 

3 

2 

0 

IV 

14 

0.91 

0.69 

NA 

+/- 

27 

2143 

0.013 

Oropharynx/BOT 

M 

49 

4 

0 

0 

IV 

15 

0.88 

0.66 

0.51 

+/- 

57 

2595 

0.022 

Larynx/supraglottis 

F 

54 

2 

1 

0 

III 

16 

0.9 

0.64 

0.75 

-/- 

54 

2237 

0.024 

Oral  cavity/tongue 

F 

78 

3 

2 

0 

IV 

17 

0.91 

0.77 

0.76 

+/+ 

114 

2105 

0.054 

Oropharynx/tonsil 

M 

47 

2 

2 

0 

IV 

18 

1.01 

0.57 

NA 

+/- 

66 

2105 

0.031 

Larynx/supra 

M 

56 

3 

0 

0 

III 

19 

0.79 

0.89 

NA 

+/- 

59 

2635 

0.022 

Larynx/glottis 

M 

54 

4 

0 

0 

IV 

20 

0.82 

0.59 

NA 

-/- 

109 

2582 

0.042 

Larynx/glottis 

M 

63 

3 

0 

0 

III 

21 

0.85 

0.57 

NA 

NA 

NA 

NA 

NA 

Larynx/glottis 

M 

59 

2 

0 

0 

II 

22 

0.89 

0.7 

NA 

+/+ 

204 

2200 

0.093 

Hypopharynx/pyriformis 

M 

56 

2 

3 

0 

IV 

23 

0.91 

0.46 

NA 

+/- 

96 

2289 

0.042 

Hypopharynx/pyriformis 

M 

61 

3 

2 

0 

IV 

24 

0.74 

0.35 

NA 

-/- 

161 

2109 

0.076 

Hypopharynx/cricoid 

F 

62 

3 

1 

0 

III 

27 

0.9 

0.85 

NA 

+/- 

27 

2418 

0.011 

Oral  cavity/tongue 

M 

67 

2 

0 

0 

II 

28 

0.9 

0.6 

NA 

+/- 

94 

2486 

0.038 

Oropharynx/palate 

M 

69 

1 

0 

0 

1 

29 

0.91 

0.46 

NA 

-/- 

141 

2151 

0.066 

Oropharynx/tonsil 

F 

70 

2 

1 

0 

III 

30 

0.9 

0.62 

NA 

-/- 

244 

2315 

0.105 

Oral  cavity/tongue 

M 

30 

3 

0 

0 

III 

31 

0.91 

0.76 

NA 

-/- 

59 

2595 

0.023 

Oropharynx/BOT 

M 

51 

4 

0 

0 

IV 

PMR-nl,  PMR  of  normal  adjacent  tissue;  PMR-prim,  PMR  of  primary  tumor;  PMR-met,  PMR  of  lymph  node  metastasis.  HPV  status: 
+/+,  positive  at  both  DNA  level  and  for  E6  transcriptional  level;  +/-,  positive  at  the  DNA  level  but  transcriptionally  silent  for  E6; 
negative.  #LOH,  number  of  10K  SNPs  that  show  loss  of  heterozygosity;  #AB,  number  of  10K  SNPs  that  are  heterozygous  in  normal 
adjacent  tissue.  Tc/Nc/Mc,  clinical  T-N-M  stage. 
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Supplementary  Figure  Legends 

Figure  SI.  Dynamic  range  of  PMA  LINE-1  and  Alu  Assays.  Mixing  experiments  were 
performed  to  determine  PMA  methylation  levels  measured  with  varying  proportions  of 
universally  methylated  and  unmethylated  DNA.  UM.C  and  U2M.C  represent  commercially 
available  (Chemicon)  universally  methylated  and  unmethylated  DNAs,  respectively.  UM.L  and 
U2M.L  were  generated  from  the  same  DNA  by  in  vitro  modification  in  our  laboratory. 

Figure  S2.  Correlation  analysis  of  SINE  and  LINE  global  methylation  assays.  A  variety  of 
tumor  and  normal  cell  line  DNAs  were  assayed  with  all  six  of  our  LINE  and  SINE  assays  (three 
LINE-1  assays  and  three  Alu  assays).  A.  LINE-1  assays  were  highly  correlated  with  each  other. 
B.  Alu  assays  were  highly  correlated  with  each  other.  C.  However,  LINE-1  and  Alu  assays  were 
only  moderately  correlated.  Two  representative  plots  are  shown  for  each  series  of 
comparisons. 
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Translational  Relevance 


Brain  metastasis  occurs  in  up  to  60%  of  non-small  cell  lung  carcinomas  (NSCLC)  and 
there  is  little  information  on  the  molecular  differences  between  primary  tumor  and 
metastases.  Our  findings  indicate  that  NSCLC  brain  metastases  have  some  significant 
differences  in  HER  family  receptors-related  abnormalities  from  primary  lung  tumors. 
These  differences  could  be  related  to  tumor  progression  and  may  cause  diverse  responses 
to  EGFR-targeted  therapy  of  primary  and  metastatic  tumor  sites. 
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Abstract 


Purpose:  To  compare  the  characters  of  HER  receptors  and  their  ligands  deregulation 
between  primary  tumor  and  corresponding  brain  metastases  of  non-small  cell  lung 
carcinoma  (NSCLC). 

Experimental  design:  Fifty  five  surgically  resected  NSCLC  primary  tumors  and 
corresponding  brain  metastases  tumor  specimens  were  included  in  tissue  microarrays. 
Immunohistochemical  expression  of  EGFR  and  phosphorylated  (p)-EGFR,  their  related 
ligands  EGF,  transforming  growth  factor-a  (TGF-a),  and  amphiregulin,  and  the 
receptors  Her2,  Her3,  and  p-Her3  was  investigated.  EGFR  copy  number  using  fluorescent 
in  situ  hybridization  was  also  performed. 

Results.  Brain  metastases  showed  significantly  higher  immunohistochemical  expression 
of  p-EGFR  (membrane,  PO.OOOl;  and  cytoplasm,  P= 0.14  ),  EGF  (membrane,  P= 0.027; 
and  nucleus,  /3=0.008),  amphiregulin  (nucleus,  P=0S)\  9),  and  p-Her3  (membrane, 
P=0.001)  than  primary  tumors  did.  Primary  tumors  showed  significantly  higher 
expression  of  cytoplasmic  (/MX01)  TGF-a.  In  adenocarcinomas,  an  identical  high 
frequency  of  EGFR  copy  number  gain  (high  polysomy  and  amplification)  was  detected  in 
primary  and  brain  metastasis  (65%)  sites.  However,  adenocarcinoma  metastases  (30%) 
showed  higher  frequency  of  EGFR  amplification  than  corresponding  primary  tumors 
(13%).  Patients  whose  primary  tumors  showed  EGFR  amplification  tended  to  develop 
brain  metastases  at  an  earlier  time  points. 

Conclusions.  Our  findings  indicate  that  NSCLC  brain  metastases  have  some  significant 
differences  in  HER  family  receptors-related  abnormalities  from  primary  lung  tumors. 
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These  differences  could  be  related  to  tumor  progression  and  may  cause  the  diverse 
responses  to  EGFR-targeted  therapy  of  primary  and  metastatic  tumor  sites. 
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Introduction 


Lung  cancer  is  the  leading  cause  of  cancer-related  deaths  in  the  United  States  (1). 
Lung  cancer  includes  several  histological  types,  the  most  frequently  occurring  of  which 
(~80%)  are  two  types  of  non-small  cell  lung  carcinoma  (NSCLC),  adenocarcinoma  and 
squamous  cell  carcinoma  (2).  The  brain  is  one  of  the  main  sites  of  metastasis  in  patients 
with  lung  cancer:  brain  metastasis  has  an  incidence  of  up  to  60%  in  patients  with  lung 
adenocarcinoma  (3-6).  The  median  survival  for  lung  cancer  patients  with  brain  metastasis 
is  usually  3-6  months  (6,7).  The  use  of  systemic  chemotherapy  and  cranial  irradiation 
has  been  shown  to  be  unsuccessful  in  the  treatment  of  NSCLC  brain  metastasis  (3,8),  and 
this  in  turn  has  motivated  the  search  for  new  therapeutic  strategies  for  this  disease. 

During  the  past  few  years,  significant  advances  have  been  made  in  the 
development  of  new  molecularly  targeted  agents  for  lung  cancer  (9).  One  example  of 
such  targets  is  the  epidermal  growth  factor  receptor  (EGFR)  that  belongs  to  the  HER 
family  tyrosine  kinase  (TK)  receptors  composed  of  four  homologous  cell  membrane 
receptors,  including  Her2  and  Her3  (10).  These  receptors  are  activated  by  seven  known 
ligands,  including,  EGF,  transforming  growth  factor  alpha  (TGF-a),  and  amphiregulin 
(11).  Deregulation  of  HER  receptors,  especially  EGFR,  appears  to  play  an  important  role 
in  the  pathogenesis  and  progression  of  NSCLC  (12).  In  lung  cancer  cells,  the  constitutive 
activation  of  EGFR  is  achieved  by  several  mechanisms,  including  increased  production 
of  ligands,  increased  levels  of  the  receptor,  and  mutation  of  EGFR  (12-14). 

Small-molecule  inhibitors  that  target  the  TK  domain  of  the  EGFR  produce  tumor 
responses  in  approximately  10%  of  patients  with  advanced  NSCLC  that  has  progressed 
despite  prior  chemotherapy  (12,15).  However,  the  brain  is  still  a  frequent  site  of  disease 
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recurrence  in  NSCLC  patients  after  an  initial  response  to  treatment  with  EGFR  TK 
inhibitors  (TKIs),  regardless  of  disease  control  in  the  lungs  (4).  Activating  mutations  in 
the  EGFR  TK  domain,  an  increased  EGFR  copy  number,  and  increased  EGFR  protein 
expression  have  been  associated  with  a  favorable  response  to  treatment  with  EGFR  TKIs 
(12,15).  Previous  reports  demonstrated  that  metastatic  NSCLC  brain  tumors  respond  to 
EGFR  TKIs  (16,17).  However,  it  is  still  unclear  whether  EGFR  and  its  related  receptors’ 
abnormalities  differ  in  metastases  compared  with  primary  NSCLC  tumors. 

We  investigated  the  immunohistochemical  expression  of  EGFR  and  related 
ligands  and  receptors  in  55  paired  primary  and  brain  metastasis  NSCLCs.  In  addition,  we 
compared  EGFR  copy  number  abnormalities  using  fluorescent  in  situ  hybridization 
(FISH)  analysis.  The  study  was  performed  in  tissue  microarrays  (TMAs)  prepared  from 
surgically  resected  NSCLC  specimens. 
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Material  and  Methods 


NSCLC  tissue  specimens.  We  obtained  archived  formalin-fixed  paraffin-embedded 
(FFPE)  material  from  surgically  resected  specimens  from  55  NSCLC  patients  with 
primary  lung  cancers  and  corresponding  brain  metastases  containing  tumor  tissues.  These 
cases  were  selected  based  on  the  availability  of  enough  archival  tissue  for  the 
immunohistochemistry  and  FISH  analyses.  All  specimens  were  from  the  Lung  Cancer 
tissue  bank  at  The  University  of  Texas  M.  D.  Anderson  Cancer  Center  (Houston,  TX) 
and  had  been  collected  between  1988  and  2002.  The  study  was  approved  by  the  M.  D. 
Anderson  institutional  review  board.  After  histologic  examination,  TMAs  were 
constructed  using  three  1-mm-diameter  cores  per  tumor. 

Detailed  clinical  and  pathological  information — including  demographic, 
pathologic  TNM  staging,  overall  survival,  and  time  of  brain  metastasis  occurrence — was 
obtained  for  all  patients  (Table  1).  Pathologic  TNM  stage  had  been  determined  for  lung 
cancers  according  to  the  revised  International  System  for  Staging  Lung  Cancer  (18)  at 
time  of  primary  tumor  surgery  with  curative  intent.  In  all  cases  the  NSCLC  brain 
metastases  were  solitary,  and  1 1  patients  also  developed  metastases  at  other  brain  sites 
(median,  13  months;  range,  <1-94  months)  over  a  median  period  of  12  months  (range, 
<1-27  months).  Forty-four  (80%)  of  55  patients  developed  clinically  detectable  brain 
metastases  after  primary  lung  cancer  surgical  resection  (median,  13  months;  range,  <1-94 
months);  in  1 1  (20%)  patients  the  brain  metastases  were  detected  at  the  same  time  as  the 
lung  tumors,  and  they  were  surgically  removed  before  (median,  less  <1  month;  range, 
<1-1 1  months)  the  primary  lung  cancer  surgery. 
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Immunohistochemical  staining  and  evaluation.  For  our  analysis,  antibodies  against  the 
following  molecules  were  purchased  and  used:  EGF  (EMD  Biosciences;  San  Diego,  CA; 
dilution  1:50),  amphiregulin  (Lab  Vision,  Freemont,  CA;  dilution  1:150),  TGF-a  (EMD 
Biosciences;  dilution  1:150),  EGFR  (Zymed,  Carlsbad,  CA;  clone  31G7;  dilution  1:100), 
phosphorylated  (p)-EGFR  Tyr  1086  (Invitrogen,  Carlsbad,  CA,  dilution  1:100),  Her2 
(Dako,  San  Diego,  CA;  dilution  1:100),  Her3  (GenTex;  San  Antonio,  TX;  dilution  1:50), 
and  p-Her3  (Cell  Signaling,  Danvers,  MA;  dilution  1:100).  Immunohistochemical 
staining  was  performed  using  5-pg-thick  TMA  histologic  sections  as  previously 
described  (19,22).  The  immunohistochemical  protein  expression  was  quantified,  using 
white  light  microscopy  with  x20  magnification,  by  two  experienced  thoracic  pathologists 
(M.S.  and  I.W.)  blinded  to  clinical  and  other  molecular  variables.  All  markers  were 
examined  for  membrane,  cytoplasm,  and  nucleus  localization  in  tumor  cells.  As 
previously  described  (20-22),  immunohistochemical  expression  was  quantified  using  a 
three-value  intensity  score  (0,  1+,  2+,  and  3+)  for  all  markers,  except  for  membrane 
EGFR  and  p-EGFR,  for  which  a  four- value  intensity  score  (0,  1+,  2+,  3+,  and  4+),  and 
the  percentage  (0%  to  100%)  of  the  extent  of  reactivity  were  used.  Next,  expression 
scores  were  obtained  by  multiplying  the  intensity  and  reactivity  extension  values  (range, 
0-300  for  all  markers,  except  for  membrane  EGFR  and  p-EGFR  with  a  range  of  0-400). 

EGFR  FISH  analysis.  We  analyzed  the  gene  copy  number  per  cell  using  the  LSI  EGFR 
SpectrumOrange/CEP  7  SpectrumGreen  Probe  (Abbott  Molecular,  Des  Plaines,  IL),  as 
previously  described  (23,24).  Tumor  specimens  were  classified  into  six  FISH  strata 
according  to  the  frequency  of  cells  with  each  EGFR  gene  copy  number  and  referred  to 


Page  8 


the  chromosome  7  centromere,  as  follows:  (1)  disomy  (three  or  four  copies  in  <10%  of 
cells);  (2)  low  trisomy  (three  copies  in  10%  to  <40%  of  cells  and  four  copies  in  <10%  of 
cells);  (3)  high  trisomy  (three  copies  in  >40%  of  cells  and  four  copies  in  <10%  of  cells); 
(4)  low  polysomy  (four  copies  in  10%  to  <40%  of  cells);  (5)  high  polysomy  (four  or 
more  copies  in  >40%  of  cells);  and  (6)  gene  amplification  (presence  of  loose  or  tight 
EGFR  gene  clusters  with  >4  copies,  EGFR  gene  to  CEP  7  ratio  >  2,  or  15  copies  of 
EGFR  per  cell  in  >10%  of  cells).  The  high  polysomy  and  gene  amplification  categories 
were  considered  to  indicate  high  EGFR  copy  number  ( EGFR  FISH  positive),  and  the 
other  categories  were  considered  to  indicate  no  significant  increase  in  the  EGFR  copy 
number  ( EGFR  FISH  negative),  as  previously  described  (23,24). 

Statistical  analysis.  Data  were  summarized  using  standard  descriptive  statistics  and 
frequency  tabulations.  Associations  between  the  marker  expression  and  patients’  clinical 
demographical  variables,  including  age,  sex,  histology  type,  and  pathologic  stage,  were 
assessed  using  appropriate  methods,  including  the  chi-squared  test  or  Fisher’s  exact  test 
for  categorical  variables  and  the  Wilcoxon  rank  sum  test  or  the  Kruskal-Wallis  test  for 
continuous  variables.  The  Wilcoxon  signed  rank  test  was  used  to  test  the  differences  in 
biomarker  expression  between  primary  lung  tumors  and  brain  metastases.  Cox 
proportional  hazard  models  were  used  for  univariate  analysis  of  time  to  metastasis 
according  to  biomarker  expression.  Hazard  ratios  with  95%  confidence  intervals  and  P 
values  are  reported.  All  tests  were  two-sided.  P  values  smaller  than  0.05  were  considered 
to  indicate  statistical  significance. 
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Results 


Immunohistochemical  expression  of  HER  receptors  and  ligands  in  NSCLC  primary 
tumors  and  corresponding  brain  metastases.  All  eight  markers,  including  the  ligands 
EGF,  amphiregulin  and  TGF-a,  and  the  receptors  EGFR,  p-EGFR,  Her2,  Her3,  and  p- 
Her3  showed  protein  expression  in  tumor  cells  from  primary  and  metastasis  sites  at  the 
membrane  and  cytoplasm  levels  (Figure  1).  Of  those,  amphiregulin,  EGF,  p-EGFR,  and 
p-Her3  showed  also  nuclear  expression  in  malignant  cells  (Figure  1).  Although  showing 
overlapping,  brain  metastases  had  significantly  higher  immunohistochemical  expression 
scores  of  EGF  (membrane  and  nucleus),  amphiregulin  (nucleus),  p-EGFR  (membrane 
and  cytoplasm),  and  p-Her3  (membrane)  than  did  corresponding  primary  tumors  (Figure 
2  and  Supplementary  Table  1).  Only  the  protein  expression  score  of  TGF-a.  at  the 
cytoplasmic  level  was  significantly  higher  in  malignant  cells  from  primary  tumors  than  in 
brain  metastasis  cells  (Figure  2  and  Supplementary  Table  1). 

EGFR  copy  number  analysis  by  FISH  in  NSCLC  primary  tumors  and 
corresponding  brain  metastases.  Overall,  the  presence  of  high  frequency  of  gain  in 
EGFR  copy  number  (FISH  positive:  high  polysomy  and  amplification;  Figure  3)  was 
similar  in  NSCLC  primary  (34/55  [62%])  and  brain  metastasis  (36/55  [65%])  sites  (Table 
2).  Although  a  relatively  lower  frequency  of  high  polysomy  was  detected  in  metastases 
than  in  primary  tumors  (38  %  versus  45%),  brain  metastases  showed  a  nonsignificant 
higher  frequency  of  EGFR  amplification  than  corresponding  primary  tumors  did  (31 
versus  16%,  P  =  0.53).  In  adenocarcinomas  (n=40  cases),  an  identical  frequency  (65%)  of 
gain  in  EGFR  copy  number  was  detected  in  primary  tumors  and  corresponding 


Page  10 


metastases.  However,  brain  metastases  of  lung  adenocarcinoma  showed  a  nonsignificant 
higher  frequency  of  EGFR  amplification  than  primary  lung  tumors  (30%  versus  13%,  P  = 
0.53).  Although  a  higher  frequency  gain  in  EGFR  copy  number  gain  was  detected  in 
brain  metastases  among  squamous  cell  carcinomas,  the  data  were  difficult  to  interpret 
because  of  the  small  number  of  cases  available  for  analysis. 

A  relatively  high  level  of  concordance  (45/55  [82%])  for  gain  in  EGFR  copy 
number  gain  was  found  between  primary  tumors  and  metastases  (Supplementary  Table 
2).  Fifteen  (27%)  paired  primary/metastasis  cases  were  EGFR  FISH  negative  in  both 
sites,  whereas  30  (55%)  paired  cases  showed  gain  in  EGFR  copy  number  at  both  tumor 
sites.  Discordance  in  EGFR  copy  number  status  was  detected  in  10  cases  (18%);  in  7  of 
these,  brain  metastasis  sites  had  a  gain  in  copy  number  while  primary  tumors  were  FISH 
negative.  The  levels  of  concordance  for  high  polysomy  (14/31  [45%])  and  amplification 
(6/19  [32%])  were  low  when  primary  tumors  and  corresponding  brain  metastases  were 
compared. 

Two  consecutive  brain  metastasis  samples  were  available  for  analysis  from  each 
of  five  adenocarcinoma  cases.  In  all  comparisons,  paired  consecutive  brain  metastasis 
specimens  showed  identical  EGFR  copy  number  status.  One  pair  was  FISH  negative  and 
the  other  four  were  FISH  positive.  FISH-positive  specimens  included  two  pairs  showing 
EGFR  high  polysomy  and  two  showing  gene  amplification. 

We  correlated  EGFR  copy  number  status  with  markers’  immunohistochemical 
expression  at  both  tumor  sites.  The  only  associations  detected  were  that  EGFR  FISH- 
positive  primary  tumors  and  brain  metastases  demonstrated  significantly  higher  protein 
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expression  scores  of  p-EGFR  (P=0.018)  and  Her2  (P=0.015),  respectively,  than  FISH 
negative  tumors. 

Correlation  between  immunohistochemical  expression  of  markers  and  EGFR  copy 
number  and  time  to  brain  metastasis.  We  investigated  the  correlation  between  the 
immunohistochemical  expression  of  the  markers  examined  and  EGFR  copy  number 
abnormalities  in  primary  lung  tumors  and  the  time  to  brain  metastasis  development.  In 
this  analysis,  we  included  only  the  44  patients  whose  brain  metastases  were  diagnosed 
after  surgical  resection  of  the  primary  tumor.  Overall,  the  median  time  to  brain  metastasis 
for  all  44  patients  was  1.23  years  (95%  Confidence  Interval  [Cl],  0.89-1.62  years).  The 
median  time  to  brain  metastasis  development  for  patients  with  adenocarcinoma  was  1 .43 
years  (95%  Cl,  0.96-2.04  years),  and  that  for  patients  with  squamous  cell  carcinoma  was 
0.89  years  (95%  Cl,  0.63  to  not  available)  Using  the  Cox  proportional  hazard  regression 
models,  we  identified  that  adenocarcinoma,  compared  with  squamous  cell  carcinoma, 
was  significantly  correlated  with  a  longer  time  to  brain  metastasis  occurrence  (P=0.009; 
HR,  0.347;  95%  Cl,  0.157-0.769),  whereas  EGFR  protein  expression  scores  (P=0.025; 
HR,  1.003;  95%  Cl,  1.000-1.006)  and  EGFR  amplification  (vs.  no-amplification)  were 
significantly  correlated  (P=0.0039;  HR,  3.492;  95%  Cl,  1.494-8.162)  with  a  shorter  time 
to  brain  metastasis  development.  None  of  these  markers  were  demonstrated  in  the 
multivariate  analysis  to  be  statistically  significant  predictors  of  metastasis  development. 
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Discussion 


In  NSCLC,  overexpression  and  activation  of  EGFR,  Her2,  and  Her3  are  well 
known  phenomena  (10,12).  However,  to  the  best  of  our  knowledge,  the  overexpression  of 
those  TK  receptors  has  not  been  previously  reported  in  NSCLC  brain  metastasis.  In  this 
study,  we  have  described  for  the  first  time  higher  levels  of  immunohistochemical 
expression  of  EGFR,  p-EGFR,  Her2,  Her3,  and  p-Her3  in  a  series  of  NSCLC  brain 
metastases.  Interestingly,  we  found  that  the  expression  of  phosphorylated  forms  of  EGFR 
and  Her3  proteins  at  the  cytoplasmic  and  membrane  level  of  malignant  cells  was 
significantly  increased  in  brain  metastasis  compared  with  expression  in  corresponding 
primary  lung  tumors.  These  findings  are  consistent  with  the  notion  that  activation  of  the 
EGFR  and  Her3  pathways  is  important  in  the  progression  and  metastasis  of  lung  cancer 
(12,15).  Similarly  to  brain  metastasis,  we  recently  showed  that  in  EGFR  mutant  lung 
adenocarcinomas,  p-EGFR  immunohistochemical  expression  was  significantly  increased 
in  nine  lymph  node  metastases  compared  with  expression  in  corresponding  primary 
tumors  (19). 

It  is  known  that  the  receptors  of  the  HER  family  are  activated  after  binding  to 
ligands  or  peptide  growth  factors,  which  include  EGF,  amphiregulin,  and  TGF-  a  (1 1).  Of 
these,  TGF-  a  (25,26)  and  amphiregulin  (27)  have  been  shown  to  be  frequently  expressed 
in  primary  NSCLC  tumors,  and  EGF  has  been  shown  to  expressed  in  NSCLC  cell  lines 
(28).  We  found  that,  compared  with  the  corresponding  primary  tumors,  NSCLC  brain 
metastases  had  significantly  higher  immunohistochemical  expression  of  membrane  and 
nuclear  EGF  and  of  nuclear  amphiregulin — ligands  associated  with  activation  of  EGFR 
dimmers  (11).  These  findings  are  consistent  with  the  concomitant  high  level  of 
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overexpression  of  p-EGFR  in  the  NSCLC  brain  metastasis  that  we  studied  and  indicate 
the  presence  of  an  autocrine  secretion  mechanism  of  these  ligands.  In  contrast  to  EGF 
and  amphiregulin,  the  cytoplasmic  expression  of  TGF-  a. another  EGFR  ligand  (11),  was 
significantly  higher  in  malignant  cells  from  primary  tumors  than  in  cells  from  brain 
metastases.  Overexpression  of  TGF-a  has  been  associated  with  the  metastatic  potential  of 
NSCLC  (28)  and  colon  cancer  (29)  cell  lines  in  favoring  modifications  of  the  tumor 
microenvironment  conducive  to  metastasis,  such  as  increasing  angiogenesis. 

In  our  study,  we  have  identified  that  two  of  ligands,  amphiregulin  and  EGF,  and 
two  receptors,  p-EGFR  and  p-Her3,  had  nuclear  expression  in  malignant  NSCLC  cells. 
There  is  evidence  that  TK  receptors,  as  well  as  their  ligands,  translocate  into  the  nucleus 
via  receptor-mediated  endocytosis  for  degradation  or  to  be  recycled  back  to  the  cell 
surface  (30-34).  However,  it  now  seems  clear  that  these  complexes  reach  into  the  cell 
nucleus  where  participate  directly  in  the  control  of  cell  proliferation,  cell  differentiation, 
and  cell  survival  (34). 

The  current  concept  of  metastasis  development  states  that  metastases  are  the 
result  of  tumor  cells  interacting  with  a  specific  organ  microenvironment,  also  called  the 
“seed  and  soil”  hypothesis  (35).  Thus,  the  microenvironments  of  different  organs, 
including  the  brain,  are  biologically  unique  and  can  explain  the  expression  of  HER 
receptors  and  ligands  in  the  brain  metastasis  tissue  specimens  differing  from  expression 
in  the  corresponding  primary  lung  tumors.  In  addition,  these  observations  have  important 
implications  for  the  development  of  molecularly  targeted  therapy  in  lung  cancer  patients. 
The  fact  that  potential  therapeutic  targets  (amphiregulin,  EGF,  TGF-a,  EGFR,  and  Her3 
are  expressed  differently  in  metastases  from  corresponding  primary  tumors  suggests  that 
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different  molecular  properties  among  tumor  sites  may  influence  differing  responses  to 
treatment  and  affect  the  levels  of  biomarkers  that  may  be  predictive  of  the  response  to 
treatment.  Although  immunohistochemical  testing  of  EGFR  has  been  shown  not  to  be  an 
optimal  method  for  identifying  patients  who  may  respond  to  treatment  with  anti-EGFR 
drugs  (15),  there  are  preliminary  data  suggesting  that  the  expression  in  tumor  tissue  of 
Her3  (14)  amphiregulin,  and  TGF-  a  (27)  correlates  with  sensitivity  and  resistance  to 
EGFR  TKI  therapy.  The  immunohistochemical  overexpression  of  Her3  in  NSCLC  tissue 
specimens  has  been  correlated  with  EGFR  TKI  sensitivity  (14).  In  contrast,  increased 
expression  of  amphiregulin  and  TGF-  a  has  been  correlated  with  resistance  to  such 
therapy  (27). 

An  increase  in  EGFR  gene  copy  number,  including  high  polysomy  and  gene 
amplification  (as  shown  by  FISH),  has  been  detected  in  22%  of  patients  with  surgically 
resected  (stages  I-IIIA)  NSCLC  (20).  Higher  frequencies  (40%-50%)  of  EGFR  high 
copy  number  have  been  reported  in  patients  with  more-advanced  metastatic  NSCLC 
(stage  IV)  (36-40)  In  the  present  study,  we  have  identified  even  a  higher  frequency  (62%) 
of  gain  in  EGFR  copy  number  in  surgically  resected  primary  NSCLC  specimens  from 
patients  who  developed  brain  metastases.  Recently,  we  reported  that  a  gain  in  EGFR  gene 
copy  number  was  detected  in  74%  of  primary  NSCLC  tumors  from  patients  who 
developed  brain  metastasis  (24).  Altogether,  these  data  suggest  a  stepwise  increase  in  the 
frequency  of  gain  in  EGFR  copy  number  in  primary  tumors  with  increasing  tumor  stage 
and,  more  important,  with  the  development  of  brain  metastasis.  Interestingly,  in  our  cases 
the  presence  of  EGFR  amplification,  along  with  membrane  EGFR  protein 
overexpression,  was  significantly  correlated  with  shorter  time  to  brain  metastasis 
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development  in  the  univariate  analysis,  further  suggesting  the  important  role  of  this 
genetic  abnormality  in  the  progression  and  metastasis  of  NSCLC. 

Recently,  we  (19)  and  others  (41)  have  shown  that  EGFR  copy  number  gain,  and 
specifically  gene  amplification,  is  a  late  phenomenon  in  the  development  of  lung 
adenocarcinoma,  appearing  at  invasive  tumor  stages  and  progressing  in  lymph  node 
metastases,  and  that  it  is  preceded  by  gene  mutation.  In  the  present  study,  we  have 
expanded  some  of  these  observations  to  NSCLC  brain  metastasis.  Although  it  was  not 
statistically  significant,  we  found  that  brain  metastases  of  lung  adenocarcinomas  had  a 
higher  frequency  of  EGFR  amplification  than  the  corresponding  primary  tumors  (30% 
versus  13%).  Although  a  relatively  high  level  (82%)  of  concordance  for  gain  in  EGFR 
copy  number  (when  high  polysomy  and  amplification  were  analyzed  together)  was 
detected  when  primary  tumors  and  metastases  were  compared,  there  were  10  discordant 
cases  (18%),  including  7  brain  metastases  that  had  increased  copy  numbers  while  primary 
tumors  did  not.  In  contrast,  we  found  that  EGFR  gene  amplification  had  a  low  level  of 
concordance  (32%)  when  primary  and  metastatic  tumors  were  compared,  indicating  a 
high  level  of  heterogeneity  for  this  phenomenon.  The  distinct  rate  of  EGFR  gene 
amplification  between  primary  tumors  and  corresponding  brain  metastases  may  support 
the  influence  of  this  phenomenon  on  differing  responses  to  treatment  and  may  impact  the 
assessment  of  this  specific  biomarker  for  anti-EGFR  therapy. 

In  summary,  our  findings  indicate  that  NSCLC  brain  metastases  exhibit  important 
differences  in  abnormalities  related  to  the  HER  family  receptors  from  primary  lung 
tumors.  These  differences  may  cause  different  responses  to  EGFR-targeted  therapy  of 
primary  and  metastatic  tumor  sites.  Although  our  series  of  cases  is  relatively  small  and 
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restricted  to  one  metastatic  site  per  patient,  the  data  strongly  suggest  that  the  analysis  of 
both  primary  and  metastasis  tumor  sites  may  be  critical  for  the  identification  of  novel 
therapeutic  targets  and  corresponding  predictive  biomarkers  in  lung  cancer. 
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Figure  Legends 

Figure  1.  Representative  microphotographs  of  immunohistochemical  expression  of  HER 
receptors  (EGFR,  p-EGFR,  Her2,  Her3,  and  p-Her3)  and  ligands  (EGF,  TFG-a,  and 
amphiregulin)  in  primary  tumors  and  corresponding  brain  metastases  (magnification, 
x400).  All  markers  showed  protein  expression  (brown  staining)  in  tumor  cells  from 
primary  and/or  metastasis  sites  at  the  membrane  and  cytoplasm  levels.  In  this 
microphotographs,  amphiregulin  and  EGF  showed  also  nuclear  expression  in  malignant 
cells. 

Figure  2.  Box  plots  showing  scores  of  immunohistochemical  expression  of  multiple 
markers  comparing  primary  tumors  with  corresponding  brain  metastases.  Panel  a  and  b, 
EGF  membrane  and  nuclear  scores.  Panel  c,  amphiregulin  nuclear  score.  Panel  d,  TGF-a 
cytoplasmic  score.  Panel  e,  p-EGFR  membrane  and  cytoplasmic  scores.  Panel/  p-Her3 
membrane  score.  P  values  comparing  normal  epithelial  and  tumor  histologic  types  are 
shown  for  all  comparisons. 

Figure  3.  Representative  microphotographs  of  FISH  showing  EGFR  copy  number  in 
primary  tumors  (PT)  and  corresponding  brain  metastases  (BM)  (magnification,  xlOOO). 
Red  signals  (red  arrows)  represent  EGFR  gene  copies  and  green  signals  (white  arrows) 
represent  the  chromosome  7  centromere  probe.  Cell  nuclei  stained  blue  with  DAPI. 
High  polysomy  is  defined  by  >4  copies  in  >40%  of  cells,  and  gene  amplification  by  the 
presence  of  loose  or  tight  EGFR  gene  clusters  and  a  ratio  of  EGFR  gene  to  chromosome 
of  2  or  15  copies  of  EGFR  per  cell  in  10%  of  the  analyzed  cells. 
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Table  1.  Summary  of  clinicopathologic  features  of  55  NSCLC  patients  with  primary 


tumors  and  corresponding  brain  metastases. 


Characteristic 

Number 

% 

Tumor  Histology 

Ademocarcinoma 

40 

73 

Squamous  cell  carcinoma 

13 

23 

Large  cell  carcinoma 

1 

2 

Adenosquamous  carcinoma 

1 

2 

Age 

<60  years 

30 

55 

>60  years 

25 

45 

Gender 

Female 

19 

35 

Male 

36 

65 

Pathological  Stage1 

I 

13 

23 

II 

10 

18 

III 

17 

31 

IV2 

1  rpi  .  r*  i  j  j  •  r* 

13 

•  i  •  r* 

23 

1  The  staging  was  performed  at  time  of  surgical  resection  of  the  primary  lung  tumor. 

2  In  1 1  cases  the  brain  metastases  were  surgically  removed  before  the  primary  tumor. 
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Table  2.  EGFR  copy  number  by  FISH  in  55  NSCLC  primary  and  corresponding  brain 
metastases  by  tumor  histology. _ 


Copy  Number 
Categories 

Adenocarcinoma 
(n= 40) 

Squamous  Cell  Ca 
(«=13) 

Total 
(n= 55)1 

Primary 

Metastasis 

Primary 

Metastasis 

Primary 

Metastasis 

n  (%) 

n  (%) 

n  (%) 

n  (%) 

n  (%) 

n  (%) 

FISH  Negative 

14(35) 

14  (35) 

7(54) 

5  (38) 

21  (38) 

19  (35) 

Disomy 

1(3) 

2(5) 

0(0) 

0(0) 

1(2) 

2(4) 

Trisomy 

2(5) 

2(5) 

1(7) 

0(0) 

3(5) 

2(4) 

Low  Polysomy 

11  (28) 

10  (25) 

6(46) 

5  (38) 

17(31) 

15  (27) 

FISH  Positive 

26  (65) 

26 (65) 

6(46) 

8(62) 

34  (62) 

36  (65) 

High  Polysomy 

21  (53) 

14  (35) 

2(14) 

5  (38) 

25  (45) 

21  (38) 

Amplification 

^ - 

5(13) 

12  (30) 

4(31) 

1 1 

3  (23) 

9(16) 

1  n  7— t  i 

17(31) 

1  One  adenosquamous  carcinoma  and  one  large  cell  carcinoma  showed  EGFR  high 
polysomy  in  the  primary  tumors  and  amplification  in  the  brain  metastasis  specimen. 
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Summary.  We  propose  a  new  Bayesian  adaptive  randomization  scheme  based  on  the 
predictive  probability  in  phase  II  trials.  We  illustrate  the  proposed  method  using  a  phase  II 
melanoma  clinical  trial,  and  conduct  extensive  simulation  studies  to  examine  the  operating 
characteristics  of  the  design. 

Key  WORDS:  Adaptive  design;  Bayesian  inference;  Phase  II  trial. 

1  Introduction 

Assume  that  we  compare  K  treatments  in  a  K- arm  phase  II  trial.  Let  pk  be  the  response 
rate  of  drug  k,  and  we  assign  pk  a  prior  distribution  of  Beta  (ak,/3k),  for  k  =  1, ...  ,K.  Given 
the  observed  data  yk  responses  out  of  nk  subjects,  Yk  ~  Biiiomial(rp:,pfc),  the  posterior 
distribution  of  pk  is 

Pk\Yk  =  Vk~  Beta (ak  +  yk,  f3k  +  nk  -  yk). 

If  the  maximum  sample  size  is  Nk,  then  the  number  of  responses  in  the  future  Nk—nk  patients, 
Xk,  follows  a  beta-binomial  distribution  Bcta-Binomiah  Ay  —  nk,  ak+yk,  f3k  +  nk  —  yk).  When 
Xk  =  xk,  the  posterior  probability  of  the  response  rate, 

Pk\(Yk  =  yk,Xk  =  xk)  ~  Beta(afc  +  yk  +  xk,f3k  +  Nk-yk-  xk). 
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Considering  only  two  treatments,  K  =  2,  we  compute  the  predictive  probability 

Ni—ni  N2  — ri2 

pp2  =  E  E  =  * il*i  =  Vi)P(X2  =  x2\Y2  =  y2 ) 

a:  1=0  X2=0 

x/{P(p2  >  pi | hi  =  yi,  X-!  =  xuY2  =  y2,X 2  =  x2)  >  #}, 

and  PPi  =  1  —  PP2.  Note  A£.  is  unknown  due  to  adaptive  randomization,  while  nk  is  known 
based  on  the  observed  data.  Let  the  number  of  total  remaining  subjects  be  m  =  N\  +  N2  — 
n\  —  n2,  which  is  known.  Then  we  approximate  Nk  by  m  x  P(p2  >  pi|hi  =  yi,  Y2  —  y2),  for 
k  =  1,2,  where  P(p2  >  p\  |h'j  =  yi,Y2  =  y2)  is  the  posterior  probability.  We  assign  a  new 
subject  to  treatment  2  with  probability  PP2- 

We  can  also  use  a  Bayesian  hierarchical  model  for  efficacy,  which  allows  us  to  borrow 
information  across  multiple  treatment  arms.  Let  (pi, . . .  ,px)  denote  the  response  rates 
corresponding  to  the  K  treatments,  and  assume  that  among  nk  patients  treated  in  arm  k,  yk 
subjects  have  experienced  efficacy.  We  model  efficacy  using  the  Bayesian  hierarchical  model: 

yk  ~  Binomial(nfc,pfc) 

pk  -  Beta(C,£)  (1) 

(  ~  Gamma(0.01, 0.01) 

£  ~  Gamma(0.01, 0.01) 

where  Bi(nk,Pk)  denotes  a  binomial  distribution,  and  Be(£,£)  denotes  a  beta  distribution 
with  a  shape  parameter  £  and  a  scale  parameter  £.  We  take  vague  gamma  prior  distributions 
GV/fO.Ol,  0.01)  with  mean  one,  for  £  and  £  to  ensure  that  the  data  will  dominate  the  posterior 
distribution.  The  full  conditional  distribution  of  pk  follows  Be(£  +  Vk ■.  £  +  'r>k  ~  Uk )  ■.  but  those 
of  £  and  £  do  not  have  closed  forms.  We  sample  from  their  conditional  distributions  using 
the  adaptive  rejection  Metropolis  method  (Gilks,  Best  and  Tan,  1995).  We  continuously 
update  the  posterior  estimates  of  the  pk  s  under  the  model  (1)  based  on  accumulated  data, 
and  adaptively  randomize  new  patients  to  efficacious  arms. 


2 


Once  the  adaptive  randomization  is  in  effect,  the  maximum  numbers  of  subjects  in  arm 
1  and  arm  2  are  random.  Let  the  remaining  subjects  in  the  trial  be  m  which  is  fixed,  and 
let  Z  be  the  number  of  subjects  that  would  be  assigned  to  arm  2.  If  we  denote  the  posterior 
probability 

7T  =  P(p2  >  Pi\D)  =  P(p2  >  Pi\Yi  =  2/1,  Y2  =  y2), 
then  Z  Bi(m,  7 r),  i.e., 

PZ(Z\D)  =  (^yz(i-7r)m-z. 

We  would  claim  arm  2  as  a  superior  treatment  arm  if 


7T  =  P(p2  >  pi\D)  >  9, 

where  6  can  be  calibrated  by  simulation  studies. 

We  can  average  over  Z,  and  then  conditional  on  Z  =  z  average  over  X1  and  X2.  Therefore, 
we  compute  the  predictive  probability 

m  m—z  z 

PP2  =  EEE  Pz{z\D)P(X1  =  xi|  W  =  yi,m-Z  =  m-z) 

z= 0  x\=0  X2  — 0 

xP(X2  =  x2\Y2  =  y2,Z  =  z)I{P(p2  >  pi|Yi  =  yi,Xi  =  x1  ,Y2  =  y2,X 2  =  x2)  >  9 }, 

m  m—z  z  /  \ 

=  El  ( ™)nz(l-n)m-zP(X1=x1\Y1  =  y1,m-Z  =  m-z) 

Z=0  tCl=0  X2=0  ' 

xP(X2  =  x2\Y2  =  y2,Z  =  z)I{P(p2  >  Pi|w  =  yi,Xi  =  x1  ,Y2  =  y2,X 2  =  x2)  >  9}, 

m  m—z  z  /  \ 

=  eee  :  {P{p2  >  p1\Y1  =  Vl,  Y2  =  y2)}z{  1  -  P(p2  >  p1\Y1  =  Vl ,Y2  =  y2)}m~z 

z=0  X l  =0  T2=0  ^  ' 

xP(X i  =  xi\Yi  =  yi,m  -  Z  =  m  -  z)P(X2  =  x2\Y2  =  y2,Z  =  z) 
xI{P(p2  >  pi\Yi  =  yi,Xi  =  x1,Y2  =  y2,X2  =  x2)  >  9 }, 


which  can  be  computationally  intensive. 
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Erlotinib  enhances  RADOOl’s  anticancer  activity  in  vivo  involving  abrogation  of 
mTOR  inhibition-induced  eIF4E  phosphorylation.  Songqing  Fan.  Xuerong  Wang.  Ping 
Yue,  John  Kauh,  Suresh  S.  Ramalingam,  Fadlo  R.  Khuri,  Shi-Yong  Sun.  Department  of 
Hematology  and  Medical  Oncology,  Winship  Cancer  Institute,  Emory  University  School  of 
Medicine,  Atlanta,  GA  30322. 

The  synergistic  anticancer  activity  between  an  mTOR  inhibitor  (e.g.  RAD001)  and  an 
EGFR  inhibitor  (e.g.,  erlotinib)  has  been  documented  in  several  studies  including  ours. 
However,  the  underlying  mechanisms  remain  unknown.  It  is  well  known  that  mTOR 
activates  cap-dependent  translation  initiation  by  preventing  4E-BP1  from  binding  to  eIF4E 
by  phosphorylating  4E-BP1.  Paradoxically,  mTOR  inhibitors  increase  eIF4E 
phosphorylation  while  inhibiting  mTOR  signaling  (Sun  et  al.  Cancer  Res,  2005).  As  a 
result,  mTOR  inhibitors’  effects  on  suppression  of  cap-dependent  translation  as  well  as  of 
cancer  cell  growth  may  be  compromised  (Wang  et  al,  Mol  Cell  Biol,  2007).  In  the  current 
study,  we  further  show  that  lung  cancer  xenografts  treated  with  RAD001  for  14  days 
exhibited  significantly  higher  levels  of  p-eIF4E  than  vehicle-treated  controls  although  the 
growth  of  these  xenografts  was  indeed  partially  inhibited,  thus  confirming  our  in  vitro 
finding  on  mTOR  inhibition-mediated  eIF4E  phosphorylation  in  vivo.  In  xenografts  treated 
with  the  combination  of  RAD001  and  erlotinib,  eIF4E  phosphorylation  was  not  increased 
compared  with  RAD001  -treated  tumors.  Accordingly,  the  combination  of  RAD001  and 
erlotinib  was  more  potent  than  each  agent  alone  in  inhibiting  the  growth  of  lung  cancer 
xenografts.  Moreover,  we  noted  that  the  expression  of  Mel- 1,  which  is  regulated  by  cap- 
dependent  translation  mechanism  and  important  for  eIF4E-mediated  oncogenesis,  was 
slightly  increased  in  RAD001  -treated  tumors,  but  significantly  inhibited  in  lung  tumors 
treated  with  the  combination  of  RAD001  and  erlotinib.  Collectively,  we  suggest  that 
erlotinib  enhances  RADOOl’s  anticancer  activity  in  vivo  involving  abrogation  of  mTOR 
inhibition-induced  eIF4E  phosphorylation.  Accordingly,  modulation  of  eIF4E 
phosphorylation  may  serve  as  a  predictive  biomarker  for  RADOOl/erlotinib-based  cancer 
therapy.  (S.  Fan  and  X.  Wang  share  first  authorship;  this  work  was  supported  by  NIH  ROl 
CA1 18450-01  and  POl  CA1 16676,  Georgia  Cancer  Coalition  Distinguished  Cancer 
Scholar  award,  DOD  IMPACT  award  W81XWH-05-0027  and  BATTLE  award 
W81XWH-06-1-0303;  SY  Sun,  SS  Ramalingam,  and  FR  Khuri  are  Georgia  Cancer 
Coalition  Distinguished  Cancer  Scholars) 


USING  AN  EDUCATIONAL  VIDEOGAME  TO  TACKLE  TOBACCO  USE 
AMONG  YOUTH 


A.V.  Prokhorov,  E.R.  Gritz,  S.  Marani 

Research  Design:  An  interactive,  tailored  educational  videogame  was  developed  for 
smoking  prevention  and  cessation  among  youth.  A  preliminary  study  was  conducted 
using  a  pretest-posttest  cohort  design  with  assessments  at  baseline,  7  days  and  6  months 
post- intervention.  Feasibility  and  impact  of  the  videogame  on  some  tobacco-related 
measures  at  6-month  follow-up  are  reported. 

Baseline  Demographic  Profile:  A  total  of  239  high-risk  alternative-school  students  were 
recruited.  Mean  age  of  the  participants  was  16.2  years  (SD=1.0),  79%  were  male.  Thirty- 
six  percent  of  the  participants  were  Hispanic  and  49%  were  African  American.  Twenty- 
five  percent  were  smokers. 

Six-Month  Feasibility  Analysis:  To  date  the  6-month  survey  has  been  completed  by  144 
participants.  All  participants  played  the  videogame  at  least  once  and  40%  played  twice  or 
more  times.  Over  85%  of  participants  reported  ease  of  use  of  this  educational  tool  and  the 
majority  of  participants  enjoyed  the  experience.  After  playing  the  videogame,  94% 
reported  increased  knowledge  about  the  tobacco  effects,  82%  were  inspired  never  to  start 
or  to  quit  and  82%  planned  to  share  the  videogame  with  family  or  friends. 

Mediating  Variables  of  Smoking  at  6-month  Follow-up:  The  primary  outcomes  of 
interest  in  this  study  were  the  mediating  determinants  of  smoking  initiation  and  cessation 
including  the  pros  and  cons  of  tobacco  use,  decisional  balance  and  temptations  to  smoke. 
These  outcomes  were  analyzed  using  mixed  model  regression  with  time  and  baseline 
smoking  status  and  their  interaction  as  fixed  effects.  There  was  as  a  significant  interaction 
effect  for  cons  of  smoking  (F  =  5.3;  p<.05),  decisional  balance  (F  =  8.0;  p<.01),  and 
temptations  to  smoke  (F  =  7.6;  p<.01).  For  baseline  smokers  6-month  temptations  to 
smoke  were  significantly  lower  than  baseline,  and  6-month  cons  of  smoking  were 
significantly  higher  than  baseline.  For  nonsmokers  these  variables  did  not  change 
significantly. 

The  educational  videogame  showed  considerable  promise  in  terms  of  increasing  tobacco 
knowledge  and  changing  attitudes  among  high-risk  youth. 
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Notch3  (N3)  is  a  member  of  the  family  of  Notch  transmembrane  receptors,  which  are  activated  by  ligands  such  as  Jaggedl 
(JAG1).  In  addition  to  roles  in  normal  cellular  differentiation  and  cell-fate  determination,  the  Notch  pathway  is  involved  in 
tumorigenesis.  N3  is  differentially  expressed  in  lung  cancer  and  sensitizes  tumors  to  EGFR  inhibition.  However,  the 
relationship  of  N3  and  JAG1  protein  expression  with  EGFR  has  not  been  studied  in  lung  cancer.  We  used 
immunohistochemical  analysis  in  tissue  microarrays  to  evaluate  N3,  JAG1,  EGFR  and  pEGFR  protein  expression  in  318 
non-small  cell  lung  carcinomas  (NSCLCs),  including  200  adenocarcinomas  (ADC)  and  118  squamous  cell  carcinomas 
(SCC).  We  correlated  these  findings  with  the  pathologic  features  of  the  tumors  and  the  clinical  characteristics  of  the  patients. 
We  also  investigated  possible  N3  rearrangement  and  amplification  using  FISH,  and  determined  the  mutational  status  of 
EGFR  in  a  subset  of  cases  (181  ADCs).  Cytoplasmic  staining  of  tumor  cells  for  N3  and  JAG1  was  detected  in  60%  (SCC 
56%;  ADC  62%)  and  95%  (SCC  99%;  ADC  93%)  of  NSCFCs,  respectively.  Staining  of  reactive  fibroblasts  for  N3  was 
noted  in  78%  of  tumors,  and  was  particularly  frequent  in  SCCs  (SCC  85%  vs.  ADC  74%,  p=0.03).  Staining  of  reactive 
endothelial  cells  for  JAG1  was  observed  in  60%  of  tumors,  and  was  also  significantly  higher  in  SCCs  (SCC  74%  vs.  ADC 
52%;  /;=(). 0002).  N3  amplification  was  detected  in  20  (8%)  of  NSCFCs  being  significantly  (p= 0.02)  higher  in  SCCs,  but  did 
not  correlate  with  N3  protein  staining  within  tumor  cells.  Tumor  cell  staining  for  N3  was  associated  with  female  gender 
(p=Q.Q4),  and  N3  amplification  and  staining  in  stromal  cells  were  significantly  higher  in  ever  smokers  compared  to  never 
smokers  (p= 0.0  and  0.02,  respectively).  In  NSCFCs,  pEGFR  nuclear  staining  was  associated  with  JAG1  cytoplasmic 
staining  (p= 0.02),  whereas  in  ADCs,  EGFR  mutations  were  strongly  associated  with  higher  levels  of  staining  for  N3 
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(p=0.0001).  Our  findings  indicate  that  N3-JAG1  proteins  are  frequently  expressed  NSCLC.  N3  amplification  is  associated 
with  smoking  history  and  squamous  cell  differentiation,  while  N3  protein  levels  in  ADCs  are  correlated  with  the  presence  of 
EGFR  mutations.  These  findings  suggest  that  NSCLC  is  commonly  associated  with  aberrations  in  the  N3-JAG1  pathway 
(Supported  by  Grant  DoD-W81XWH-05-2-0027  and  NCI-SPORE-P50CA70907). 
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Estrogen  (ER)  a  and  (5  and  progesterone  (PR)  receptors  are  transcription  factors  that  regulate  the  expression  of  multiple 
genes  and  have  been  involved  in  the  pathogenesis  of  non-small  cell  lung  cancer  (NSCLC).  There  are  conflictive  results 
reported  on  the  immunohistochemical  (IHC)  expression  of  ER  receptors  in  NSCLC  tissue  specimens  using  different 
commercially  available  antibodies  (Abs).  Correlation  between  ER  expression  and  EGFR  mutation  status  has  been  reported 
in  lung  cancer  cell  lines.  Our  aims  were  to  compare  the  IHC  expression  pattern  in  tumor  tissues  of  six  commercially 
available  ER  antibodies  (four  a-  and  two  (3)  in  a  large  set  (N=317)  of  NSCLCs  tissue  microarrays  containing  201 
adenocarcinomas  and  116  squamous  cell  carcinomas,  and  correlate  the  expression  of  ER  and  PR  with  patients’  clinical- 
pathologic  characteristics,  and  in  adenocarcinomas  with  EGFR  mutation  status.  The  Abs  used  were:  a)  ERet:  clone  6F1 1(2 
abs),  clone  HC-20  and  clone  1D5N;  b)  ER(3:  clone  H-150  and  14C8;  and,  c)  PR:  polyclonal.  ER  et  and  (3  expression  were 
detected  in  the  cytoplasm  (ERet  clone  HC-20  and  clone  lD5N)18%-42%;  ER(3  20-98%)  and  the  nucleus  (ERet  5%-36%; 
ER(3  42-56%)  of  malignant  cells,  however  the  frequency  of  expression  in  tumors  varied  between  Abs.  PR  expressed  only  at 
nuclear  level  in  63%  of  tumors.  Higher  nuclear  expression  of  most  ERet  Abs  significantly  correlated  with  adenocarcinoma 
tumor  histology,  female  gender  and  never  smoking  history.  The  Ab  raised  against  the  ERet  N-terminus  protein  (clone  1D5N, 
Lab  Vision  Co,  Fremont,  CA)  significantly  correlated  with  all  those  variables  when  stained  in  the  nucleus  and  with  worse 
recurrence  free  survival  (HR  1.77,  95%  Cl  1.12,  2.82;  P=0.015)  when  stained  in  the  cytoplasm.  Higher  nuclear  and 
cytoplasmic  expressions  of  ERet  Abs  clones  HC-20  and  1D5N,  and  higher  nuclear  expression  an  ER(3  Ab  clone  H150 
correlated  with  the  presence  of  EGFR  mutation  in  adenocarcinomas  (P=0.0029  to  <0.00001).  We  conclude  that  both  ERs 
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and  PR  express  in  NSCLC,  and  we  identified  Abs  against  ERs  that  distinguish  a  subset  on  NSCLC  having  defined  clinical- 
pathologic  features,  including  EGFR  mutation  in  lung  adenocarcinoma  histology.  Then,  we  studied  the  expression  of  ER  and 
PR  in  the  early  pathogenesis  of  lung  cancer,  and  for  ER  we  selected  one  Ab  for  each  type  of  receptor  (a  clone  1D5N;  (5 
clone  H150).  We  found  that  cytoplasmic  ER(5  and  nuclear  PR  were  frequently  expressed  in  the  normal  bronchial  and 
bronchiolar  epithelium  (n=194)  adjacent  to  lung  adenocarcinomas  as  a  field  effect  phenomenon.  Of  interest,  normal  and 
hyperplastic  alveolar  cells  did  not  express  those  receptors,  suggesting  a  cell  type-based  specific  field  abnormality.  Our 
findings  suggest  that  hormonal  receptor  antagonist  therapy  could  be  considered  in  the  development  of  novel  therapeutic  and 
chemopreventive  strategies  in  NSCLC  (Supported  by  Grant  DoD-W81XWH-04-l-0142  and  W81XWH-05-2-0027). 
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The  brain  is  one  of  the  main  metastatic  sites  for  lung  cancer,  occurring  in  ~50%  of  patients  with  non-small  cell  lung 
carcinoma  (NSCLC).  Despite  recent  advances  in  lung  cancer  targeted  therapy,  there  is  limited  information  on  the  molecular 
characteristics  of  lung  cancer  brain  metastasis.  In  NSCLC,  EGFR  gene  increased  copy  number  has  been  involved  in  tumor 
development  and  it  has  been  detected  in  ~23%  of  surgically  resected  tumors.  In  advanced  tumors,  it  has  been  associated  to 
prediction  of  response  of  EGFR  tyrosine  kinase  inhibitors.  To  better  characterize  NSCLC  brain  metastasis,  we  investigated 
the  EGFR  copy  number  by  fluorescence  in  situ  hybridization  (FISH)  in  58  paired  NSCFC  primary  tumors  (42 
adenocarcinomas,  14  squamous  cell  carcinomas,  and  2  others)  and  corresponding  brain  metastasis  using  tissue  microarrays. 
We  correlated  these  findings  with  the  immunohistochemical  expression  of  HER  family  ligands  and  receptors  in  the  same 
tumor  specimens,  including  EGF,  amphiregulin,  TGFct,  EGFR,  p-EGFR,  Her2,  Her3,  and  p-Her3.  EGFR  FISH  (+)  or 
increased  copy  number  was  defined  as  high-polysomy  and/or  gene  amplification.  Higher  frequency  of  EGFR  FISH+  was 
seen  in  brain  metastasis  (68%),  especially  gene  amplification  (29%),  compared  to  corresponding  primary  tumors  (FISH+ 
59%;  gene  amplification  17%).  Correlation  of  FISH  status  between  primary  tumor  and  corresponding  metastasis  was  78%. 
In  primary  tumors  and  brain  metastases  the  frequency  of  FISH+  was  higher  in  adenocarcinomas  (64%  and  71%, 
respectively)  compared  to  squamous  cell  carcinomas  (43%  and  57%,  respectively).  In  primary  tumors,  similar  high  level  of 
EGFR  FISH+  was  found  comparing  stages  T/II  (60%)  and  ITI/IV  (62%).  EGFR  FISH+  correlated  with  higher  expression  of 
amphiregulin  (P=0.01 1)  and  Her2  (P=0.006)  in  primary  tumors,  and  with  pEGFR  (P=0.024)  in  brain  metastases.  Primary 
tumors  with  EGFR  gene  amplification  demonstrated  the  shortest  time  for  brain  metastasis  development  after  surgical 
resection  with  curative  intent.  Our  findings  indicate  that  NSCLC  brain  metastasis  and  their  corresponding  primary  tumors 
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demonstrate  a  high  frequency  of  EGFR  increased  copy  number  by  FISH,  especially  gene  amplification,  suggesting  that  this 
molecular  abnormality  may  be  involved  in  tumor  progression  phenomenon  (Supported  by  Grant  W81XWH-05-2-0027). 
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TITF-1  and  EGFR  gene  copy  variations  are  associated  with  prognosis  for  the 
patients  with  non-small  cell  lung  cancer 

Ximing  Tan,  Diane  Liu,  Carmen  Behrens,  Dandan  He,  Menghong  Sun,  David  Rice,  J. 
Jack  Lee,  Waun  K.  Hong,  and  Ignacio  I.  Wistuba 
UT  MD  Anderson  Cancer  Center,  Houston,  TX  77030 

Thyroid  transcription  factor  -1  (TITF-1,  a  lineage-specific  transcription  factor),  and  the 
epidermal  growth  factor  receptor  (EGFR,  a  tyrosine  kinase  membrane  receptor)  have 
shown  frequent  gene  amplification  in  non-small  cell  lung  carcinomas  (NSCLC).  We 
investigated  the  clinico-pathologic  characteristics  of  NSCLCs  having  TITF-1  andor 
EGFR  gene  copy  number  abnormalities  by  examining  gene  copy  number  status  using 
quantitative  polymerase  chain  reaction  (qPCR)  and  DNA  extracted  from  microdissected 
formalin-fixed  and  paraffin-embedded  tissue  from  53  adenocarcinomas  and  29  squamous 
cell  carcinomas.  B-actin  gene  was  used  as  reference.  In  tumors,  gene  copy  ratio  referred 
to  fi-actin  ranged  from  0.22  to  74.93  (median=1.52)  for  TITF-1,  and  0.05  to  6.28 
(median=1.51)  for  EGFR.  Ratios  1  to  2  were  defined  as  normal  gene  copy  number  (NGC). 
Ratios  <1  and  >2  were  defined  as  low  gene  copy  (LGC)  and  high  gene  copy  (HGC) 
number,  respectively.  Both,  LGC  and  HGC  categories  were  defined  as  abnormal  gene 
copy.  Similar  frequencies  of  TITF-1  and  EGFR  copy  number  categories  were  detected 
comparing  adenocarcinoma  ( TITF-1 :  LGC  15,  28%;  NGC  20,  38%;  HGC  18,  34%; 

EGFR:  LGC  9,  17%;  NGC  27,  51%;  HGC,  17,  32%)  with  squamous  cell  carcinoma 
(TITF-1:  LGC  8,  28%;  NGC  14,  48%;  HGC  7,  24%;  EGFR  :  LGC  5,  17%;  NGC  18,  62%; 
HGC  6,  21%).  We  found  a  statistically  significant  correlation  between  TITF-1  and  EGFR 
copy  numbers  (Spearman  correlation  coefficients. 36,  FS.0008).  In  both  tumor 
histologies,  neither  TITF-1  nor  EGFR  gene  copy  increase  (ratio  >2)  correlated  with 
disease  prognosis.  However,  in  adenocarcinomas,  Kaplan-Meier  and  log  rank  tests 
revealed  that  the  median  time  to  death  was  longer  in  patients  with  normal  copy  number 
compared  with  those  with  abnormal  copies  for  TITF-1  (median  4.76  years,  95%  Cl 
2.95-NA,  P  =  0.04)  and  EGFR  (4.76  years,  95%  Cl  3.13-  NA,  P=0.04).  Moreover, 
adenocarcinoma  patients  with  combined  TITF-1  and  EGFR  abnormal  copy  showed  worse 
overall  survival  (3.56  years,  95%  Cl  3.13-  NA)  compared  with  patients  with  normal  copy 


status  (median  not  reached,  .P=0.003).  In  these  patients,  multicovariate  Cox  modeling 
indicated  that  combined  copy  abnormality  of  both  genes  is  an  independent  factor  for 
worse  overall  survival  (HR  4.566,  P=0.0057).  Our  findings  suggest  that  loss  and  gain  of 
TITF1  and  EGFR  are  frequent  abnormalities  in  both  adenocarcinomas  and  squamous  cell 
carcinomas  of  the  lung,  and  in  adenocarcinoma  patients  correlate  with  disease  outcome. 
(Supported  by  Grant  DoD-W81XWH-04-l-0142  and  W81XWH-05-2-0027). 

( TITF-1  and  EGFR  gene  copies  in  lung  cancer) 
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Thyroid  transcription  factor  -1  (TITF-1),  a  lineage- specific  transcription  factor  frequently  overexpressed  in  lung 
adenocarcinoma,  has  been  recently  reported  to  show  gene  amplification  in  a  subset  of  these  tumors.  To  better  characterize 
TITF-1  copy  number  in  non-small  cell  lung  carcinomas  (NSCLC)  and  correlate  it  with  protein  expression,  we  studied  both 
gene  copy  number  and  protein  expression  using  fluorescent  in  situ  hybridization  (FISH)  and  immunohistochesmistry  stain 
(IHC)  assays  in  a  large  series  (N=324)  of  surgical  resected  NSCLCs  placed  in  tissue  microarrays,  including  205 
adenocarcinomas  and  1 19  squamous  cell  carcinomas.  We  correlated  our  findings  with  patients’  clinico-pathologic 
characteristics,  and  in  a  subset  of  adenocarcinomas  with  EGFR  (exons  19-21)  and  KRAS  (exons  1  and  2)  mutation  status. 
TITF-1  amplification  (FISH+,  clustered  gene  signals)  was  detected  in  19%  (51  out  of  269)  of  tumors,  without  differences  by 
histology  (18%  of  squamous  cell  carcinoma  and  19%  of  adenocarcinoma).  TITF-1  protein  high  level  expression  (IHC+, 
semiquantitative  score  2200,  range  0-300)  was  detected  exclusively  in  adenocarcinomas  (48%  of  cases),  and  in  this  tumor 
type  correlated  with  gene  amplification  (P=0.005).  No  correlation  between  TITF1  abnormalities  and  patients’  age,  gender, 
smoking  status  and  TNM  stages  was  detected.  In  adenocarcinomas,  IHC+,  but  not  FISH+,  correlated  with  EGFR  and  KRAS 
mutations:  IHC+  was  more  frequently  found  in  EGFR  (16/21,  76%  vs.  59/172,  34%,  PcO.OOl  )  and  KRAS  (8/11,  72%  vs. 
26/75,  34%,  P=0.016)  mutant  compared  with  wild-types  tumors.  Survival  analysis  showed  that  for  adenocarcinoma  TITF-1 
FISH+  correlated  with  worse  recurrence-free  survival  (F=0.001),  while  IHC+  correlated  with  better  recurrence-free  survival 
CP=0.036).  Our  findings  indicate  that  TITF-1  amplification  occurs  in  a  subset  of  NSCLCs,  including  both  major  tumor 
histologies:  adenocarcinoma  and  squamous  cell  carcinoma.  The  association  of  TITF1  expression  with  EGFR  and  KRAS 
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mutation  in  lung  adenocarcinomas  may  correlate  with  the  peripheral  airway  origin  of  these  tumors.  Both  TITF-1  gene 
amplification  and  protein  expression  correlate  with  NSCLC  patients’  prognosis.  (Supported  by  Grant  DoD-W81XWH-04-l- 
0142  and  W81XWH-05-2-0027). 
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Category:  Advances  in  PET/SPECT  Probes 

Introduction:  Recently,  a  novel  radiotracer  [18F]F-PEG6-IPQA  was  developed  by  us  for  PET  imaging  of  active- 
mutant  EGFR  expression  lung  carcinomas.  For  preparation  of  an  1ND  for  a  clinical  Phase  I  trial,  the 
pharmacokinetics  (PK),  biodistribution,  metabolism,  and  radiation  dosimetry  of  [18F]F-PEG6-IPQA  was  studied 
in  non-human  primates. 

Methods:  The  [18F]F-PEG6-IPQA  was  synthesized  in  several  steps.  The  intermediate,  2-[2-(2-{2-[2-(2-[4-(3- 
lodophenylamino)-6-nitro-quinazoline-7-yloxy]-ethoxy]-ethoxy}-ethoxy)-ethoxy]-ethoxy}-ethanol  was  mesylated 
and  radiofluorinated  with  [18F]KF/Kryptofix222  in  DMSO  (120°C,  30  min).  Thereafter,  the  6-nitro  group  was 
reduced  to  amine  and  converted  to  an  acrylamide.  The  final  purification  of  [18F]F-PEG6-IPQA  was 
accomplished  with  HPLC  (>99%  pure).  Three  rhesus  macaques  (6-10  kg)  were  injected  intravenously  with  [18F] 
F-PEG6-IPQA  (5  mCi/animal).  Dynamic  imaging  was  performed  using  Discovery  ST8  PET/CT  (GE  Healthcare) 
for  the  first  30  min  covering  the  thoraco-abdominal  area,  followed  by  whole  body  static  images  at  60,  90, 120, 
and  180  min.  Blood  was  sampled  at  1-1.5-2-3-5-8-16-32-60-90-120-180  min,  urine  -  at  the  end  of  imaging  and 
extracted  with  3-fold  volume  of  acetonitrile.  Radiolabeled  metabolites  were  analyzed  by  HPLC.  The  PK  and 
radiation  dosimetry  estimates  were  obtained  from  blood  sampling  and  region  of  interest  analysis  (ROI)  data 
measured  on  PET/CT  images. . 

Results:  [18F]F-PEG6-1PQA  exhibited  rapid  redistribution  upon  i.v.  injection  and  a  bi-exponential  blood 
clearance  with  fast  and  slow  phase  half-times  of  ~2  and  -22  min,  respectively,  via  both  hepatobiliary  and  renal 
pathways.  The  degradation  of  [18F]F-PEG6-IPQA  to  [18F]F-PEG6  was  slow  (T1/2~4min).  The  radiation  dose- 
limiting  organs  were  liver,  upper  intestine  and  kidneys.  Lung  tissue  exhibited  low  uptake  of  [18F]F-PEG6-IPQA. 
Diagnostic  doses  up  to  20  mCi  of  [18F]F-PEG6-IPQA  can  be  safely  administered  to  human  patients. 
Conclusion:  The  pharmacokinetic  and  radiation  dosimetry  characteristics  of  [18F]F-PEG6-IPQA  should  be 
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The  epithelial-to-mesenchymal  transition  (EMT)  governs  the  radiation  sensitivity  of 
non-small  cell  lung  cancer  (NSCLC)  cells.  Yuhui  Yuan,  Jenny  Liu,  Anupama  Munshi, 
Kathleen  A.  Jensen,  Ashley  Roberts,  and  Raymond  E.  Meyn.  Department  of 
Experimental  Radiation  Oncology,  The  University  of  Texas  M.  D.  Anderson  Cancer 
Center,  Houston,  TX. 

EMT  has  important  roles  during  embryonic  development  where  it  controls  tissue 
remodeling.  EMT  is  also  known  to  be  important  in  cancer  where  mesenchymal-like 
tumor  cells  that  have  undergone  EMT  have  lost  cell-cell  junctions  and  acquired  traits 
characteristic  of  malignancy,  including  the  ability  to  invade  and  migrate.  This  loss  of  cell 
adhesion  is  due  to  a  down-regulation  of  E-cadherin  expression  at  the  level  of  gene 
transcription.  Thus,  loss  of  E-cadherin  expression  is  a  hallmark  of  EMT.  It  has  been 
reported  recently  that  the  EMT  status  of  NSCLC  cells  correlates  with  their  sensitivity  to 
inhibitors  of  the  epidermal  growth  factor  receptor  (EGFR).  Since  there  is  considerable 
interest  in  combining  such  inhibitors  with  radiotherapy  for  the  treatment  of  cancer,  we 
tested  whether  EMT  status  also  governs  radiosensitivity  of  NSCLC  cells.  A  total  of  8 
NSCLC  cell  lines  were  selected  based  on  their  status  of  E-cadherin  expression,  and  tested 
for  radiosensitivity  using  clonogenic  survival  analysis.  Four  lines  expressing  E- 
cadherin — H322,  H358,  H292  and  Calu  3 — had  surviving  fraction  at  2  Gy  (SF2)  values 
ranging  from  0.20  to  0.44.  Four  lines  negative  for  E-cadherin  expression — H1299,  A549, 
H460  and  H520 — had  SF2  values  ranging  from  0.5 1  to  0.74.  Thus,  there  was  a  general 
correlation  between  EMT  based  on  loss  of  E-cadherin  expression  and  radioresistance.  To 
further  test  this  relationship,  we  restored  E-cadherin  expression  in  HI 299  and  A549  cells 
using  an  E-cadherin  expression  vector.  Stable  clones  of  HI 299  and  A549  cells 
expressing  E-cadherin  were  isolated  and  clonogenic  survival  analysis  on  these  indicated 
that  they  were  radiosensitive  compared  to  similarly  isolated  clones  expressing  a  control 
vector.  Based  on  these  results,  we  conclude  that  EMT  may  govern  the  radiosensitivity  of 
NSCLC  cells.  Strategies  that  are  designed  to  restore  E-cadherin  expression  in  NSCLC 
that  have  undergone  EMT  may  be  useful  when  combined  with  radiotherapy  for  the 
treatment  of  this  disease. 
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Introduction:  In  gliomas,  epidermal  growth  factor  receptor  (EGFR)  amplification  and  activating  mutations  (i.e., 
EGFRvlll)  are  critical  for  oncogenesis,  maintenance  and  progression.  Positron  emission  tomography  (PET)  with 
active  EGFR  kinase-specific  radiolabeled  tracers  could  provide  the  means  for  noninvasive  imaging  of  the 
magnitude  and  spatial  heterogeneity  of  EGFR  expression  in  gliomas  in  individual  patients,  that  is  critical  for 
selection  and  monitoring  of  therapy  with  EGFR  inhibitors  and  for  prognosis.  In  this  study,  the  efficacy  of  [124l]l- 
PEG6-IPQA  PET  was  assessed  for  imaging  EGFRvlll  expression  in  intracerebral  glioblastoma  xenografts  in 
mice. 

Methods:  Nude  mice  (N=12)  were  implanted  intracerebrally  with  human  U87/GFP-Luc  cells  expressing  wild- 
type  EGFR  or  retrovirally  transduced  with  EGFRvlll  (U87GFP-Luc/EGFRvlll)  106  cell/animal.  Tumor  growth  was 
monitored  by  BLI.  Ten  days  later,  the  mice  were  injected  with  [124I]I-PEG6-IPQA  1 50  //Ci,  100//I,  i.v.)  and 
dynamic  PET  imaging  was  preformed  for  2h,  followed  by  static  imaging  at  3h.  Thereafter,  the  animals  were 
sacrificed,  brains  removed,  frozen,  and  cryosectioned  at  20pm.  Adjacent  cryosections  were  used  for 
quantitative  autoradiography  (QAR)  and  immunohistochemical  (IHC)  staining  for  EGFR  and  EGFRvlll. 
Treatment  group  (N=6)  received  EGFR  kinase  inhibitor  Iressa  (100  mg/kg,  i.p.  in  DMSO,  1h  prior  to  [124l]l- 
PEG6-IPQA  administration). 

Results:  Dynamic  PET  imaging  demonstrated  a  more  rapid  dynamics  of  [124l]l-PEG6-IPQA  accumulation  in 
U87GFP-Luc/EGFRvlll,  as  compared  to  U87/GFP-Luc  xenografts.  Iressa  treatment  reduced  2h  accumulation 
levels  of  [124I]I-PEG6-1PQA  in  U87/EGFRvlll  xenografts  two-fold  from  1 .2±0.1  to  0.63+0.04  %ID/g,  as  compared 
to  non-treated  animals.  Smaller  changes  were  observed  in  U87/GFP-Luc  xenografts  -  0.42±0.05  to  0.36+0.03 
%ID/g,  respectively.  Comparison  of  QAR  and  IHC,  demonstrated  good  spatial  correspondence  between  the 
radioactivity  distribution  and  EGFRvlll  expression. Conclusions:  PET  imaging  with  [124I]I-PEG6-IPQA 
demonstrated  efficacy  for  non-invasive  visualization  and  quantitation  of  of  wild-type  EGFR  and  active-mutant 
EGFRvlll  expression  in  intracerebral  human  glioma  xenografts  in  mice  and  could  be  applicable  for  prediction  of 
responsiveness  to  therapy  with  EGFR  kinase  inhibitors. 
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Introduction:  Response  to  EGFR  kinase  inhibitor  therapy  in  NSCLCs  is  associated  with  overexpression  and 
activating  mutations  in  the  EGFR  kinase  domain,  observed  only  in  8%  of  Caucasian  and  54%  east-asian 
patients.  Non-invasive  imaging  of  EGFR  expression-activity  in  NSCLCs  would  provide  the  means  for  prediction 
of  responses  to  therapy  with  EGFR  kinase  inhibitors.  Therefore,  we  assessed  the  efficacy  of  [124I]I-PEG6-IPQA 
for  PET  imaging  of  active-mutant  EGFR  expression  in  NSCLCs. 

Methods:  Subcutaneous  tumor  xenografts  were  established  in  immunecompromised  mice  using  four  non-small 
cell  lung  carcinoma  (NSCLC)  cell  lines  (N=6  per  line)  with  different  levels  of  wild-type  and  active-mutant 
epidermal  growth  factor  receptors  (EGFRs)  and  different  responsiveness  to  EGFR  kinase  inhibitors:  a)  H441 
(wild-type  EGFR;  resistant);  b)  PCI  4  (wild-type  EGFR;  resistant);  c)  H3255  (active-mutant  L858R  EGFR; 
sensitive);  d)  HI  975  (double-mutant  L858R  and  T790M  EGFR;  resistant).  Control  and  Iressa-treated  groups 
(prior  to  1h  imaging,  100  mg/kg  of  100 //I  in  DMSO  i.p.)  were  injected  with  [124I]I-PEG6-IPQA  150//Ci,  100//I, 
i.v.)  and  dynamic  PET  imaging  was  preformed  for  2h,  followed  by  a  static  scan  at  3h.  Thereafter,  the  animals 
were  sacrificed,  tumors  removed,  frozen,  and  cryosectioned  at  20pm.  Adjacent  cryosections  were  used  for  QAR 
and  immunohistochemical  (IHC)  staining  for  EGFR  and  phoshoEGFR. 

Result:  In  vivo  PET  and  QAR  images  obtained  at  3  hours  post  [124I]I-PEG6-IPQA  injection  demonstrated 
preferential  accumulation  of  radioactivity  in  H3255  carcinomas  expressing  the  active-mutant  EGFR  (1.4±0.1=2 
%ID/g  )  and  in  H441  carcinomas  over-expressing  the  wild-type  EGFR  (1 ,2±0.1  %ID/g).  Pre-treatment  with 
Iressa  inhibited  [124I]I-PEG6-IPQA  accumulation  in  H3255  and  H441  carcinomas  (0.64+0.04  0.75±0.06  %!D/g, 
respectively).  [124l]l-PEG6-iPQA  accumulation  levels  in  all  studied  tumor  types  correlated  positively  with  wild- 
type  pEGFR  or  active-mutant  EGFR  expression  in  corresponding  regions.  Conclusion:  [124I]I-PEG6-IPQA 
demonstrated  efficacy  for  PET  imaging  of  overexpression  of  wild-type  and  active-mutant  EGFR  in  NSCLC  in 
tumors  and  for  prediction  of  responsiveness  to  therapy  with  EGFR  kinase  inhibitors. 
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